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A prime concern of the software industries has been 
development of programs which are easy to understand and 
maintain. Through a desire to reduce the perceived complexity 
of programs so as to make them easily understood, this has 
led to various studies directed towards identification of 
factors contributing to psychological complexity of programs 
and development of complexity -reducing methodologies. 

Program understanding may be viewed as two stage 
process. In the first (interpretation) stage, the explicit 
facts about the program are abstracted for mental represen- 
tation. In the second (learning) stage, various meta- 
inf erential techniques are applied to search for rules 
(implicit facts) hidden under the gathered explicit facts 
by suggesting hypothesis for test and eventual confirmation 
into such rules. Several variants of the above model have 
been presented in literature as models of program under- 
standing. 





The psychological complexity of programs will he 
affected by all those factors which either aid or hinder 
the interpretation and learning of the programs. The 
features of the program which suggest an analogy or estab- 
lish an association with a problem domain are expected to 
facilitate learning by reducing the effort in seardh of 
hypothesis. These features of the program are contained 
in comments, variable names, data and control organizations, 
etc., and are collectively termed as ’meaningfulness’ of 
the program. 

The abstraction of facts about a program may involve 
mental expansion of program text in branch free sequences 
of statements or blocks followed by mental execution of 
these blocks. Control structure and data structure are 
expected to play an important role in formation of above 
blocks and their mental execution. They may also aid in 
’meaningfulness' of the program. The limited capacity and 
volatile nature of human short-term memory make the size 
of the program an important factor of complexity. 

This thesis establishes these four algorithm 
dependent features: size, control structure, data structure 
and computation structure of program' as factors of program 
complexity and studies their effect on program understand- 
ing. The thesis also studies the effect of ’meaningfulness' 
of programs on their understanding. 



The formal abstraction of structures of program objects 
or data are claimed to have important bearing on program under- 
standing, Many new programming languages (OIU, A1PHA.RD, etc.) 
have been designed to incorporate the methodology of data 
abstraction. Some formal aspects of data abstraction have 
been studied in the thesis. 

The ma;}or work: of the thesis is outlined below along 
with results and conclusions. 

A four factor judgemental experiment was conducted 
with experienced programmers as subjects to study the con- 
tribution of factors of size, control structure, data struc- 
ture and computation structure to program complexity. These 
factors will be referred as 4-factors of program. Information 
integration theory was applied to identify the underlying rule 
of integration of various factors of program complexity. The 
result of the experiment was: 

(1) Pour factors of the program contributed indepen- 
dently to perceived program complexity. 

A second experiment was conducted to investigate the 
effect of 'meaningfulness^ of the expressions and the number 
of operands in the expressions on their evaluation. The 
'meaningfulness’ of the expressions were due to their 
referential meaning in some problem domain. The results of 
the experiment were: 

(2) It was easier to evaluate meaningful expressions 
than the meaningless ones. 



(3) The evaluation of expression v/as affected ■by- 
number of operands in them. 

The purpose of the third experiment -was to study the 
effect of meaningfulness of the program and number of vectors, 
matrices, etc., referred to as numerosity of data structures, 
on program understanding. The value of meaningfulness -was 
controlled by selecting variable names -which were suggestive 
of their meaning and purpose and familiarity of the subjects 
with the problem. The results were: 

(4) The ’meaningfulness’ of the program facilitated 
their understanding. However, there was one 
interesting exception to this conclusion. 

(5) The numerosity of data structures in the program 
did not affect program understanding significantly. 

The above result (5) and the exception to conclusion (4) 
are against the intuitive expectation. The results of the 
further experiments indicated that these unexpected results 
were due to special fea-tures of the program used in the 
experiment. The uncontrolled fea-ture dominated over the 
controlled factors. 

Two measures of program comprehension were also 
compared in the experiment. 

These above experiments indicated the overwhelming 
strength of Influerrce of ’meaningfulness’ factor-. It was 
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decided to use programs -which were as uniformly meaningful as 
possible in the experiments. Many programming standards of 
real life endeavour to achieve just this situation. 

A 3x3 factorial experiment was conducted to study the 
effects of control and computation structures on program 
understanding. The results of the experiment were: 

(6) factor of computation structure (controlled by number 
of assignment statements and the number of operands 
in each) affected program understanding. 

(7) The numerosity of control statements in the programs 
did not increase their control structure complexity. 
The regularity in the syntax and semantics of the 
control structure of the progam facilitated under- 
standing. 

The above three experiments were 1- and 2-factor 
experiments. The subjects of the experiment 2 and 3 were 
novice programmers. These experiments did not permit the 
study of interaction among the factors. The class-room 
environment and the strict time schedule of the course 
precluded use of large programs and inclusion of more factors 
in the experiment. 

A four factor experiment was conducted with experienced 
programmers as subjects. The results of that experiment were: 



(8) All the factors except program size and some of 
their interactions affected program understanding. 
The differences in the size of the programs vere 
not large enough to affect program understanding. 

In another experiment, the sixteen programs used in 
the previous experiment were subjectively rated for their 
complexity. The results, in general, confirmed the findings 
of the previous experiment. 

The results of the experiments suggest that all the 
four algorithm dependent features of the program along with 
’meaningulness’ of the program affect program understanding. 
Quantitative syntactic measures for complexity of the 
factors do not represent their contribution to psychological 
complexity of program. 

The limited data structuring constructs of FORTRAN 
did not permit the experimentation with various aspects of 
factor of data structure complexity. As an alternative to 
experimentation, different specification techniques for 
data abstraction were studied theoretically and compared 
for their good and bad features. 

(9) We have proved that Parnas' earliest (1972) 
specification technique is powerful enough to 
specify any computable function. Further, it 
has been proved that the completeness and consis- 
tency of the specification are not decidable. 



Finally, some modifications are suggested in the 
above technique to facilitate the specification 
of any exception handling strategy and ' delayed 
effects* . 

The thesis, in the end, also discusses the limitation 
of the present -work and suggests direction for further 
research. 



CHAPTER 1 


IHTROEUOTION 

Program debugging, maintenance and modification are 
major activities of software industries. An observation of 
Elshoff , [ Elshoff ,76a,blina study of General Motor 
installations , that seventy-five percent of all programmers' 
time was spent on maintenance and modification of existing 
programs underscores the importance of above activities. An 
important prerequisite for debugging, maintenance and 
modification of programs is their comprehension. It is 
now widely felt that the complexity of programs should'ibe 
signif icantly reduced to facilitate their understanding. 

This has led to various studies directed towards identifi- 
cation of factors contributing to complexity of programs 
and development of complexity-reducing methodologies. The 
complexity referred above is not the computational complexity 
of the program but its psychological complexity. These two 
complexities should be clearly distinguished. The computational 
complexity characterises the order of time and storage 
requirements of the program. On the other hand, the psycho- 
logical complexity refers to those characteristics of program 
which make human understanding of the program difficult, Ho 
direct relationship is expected between these two types of 
complexity. 
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Psychological complexity of programs is one of the 
concepts which one knows well hy intuition but finds 
difficult to formalise. This difficulty is mainly due to 
absence of formal theory of program understanding. 'N’o 
general model has emerged either from empirical studies 
conducted to identify various factors of program complexity 
or from the research in the field of Artificial Intelligence. 
However, some common features have emerged from various 
models described in the literature. 

In the model proposed by Mills, [Mills, 72], one 
major step in understanding a program text is the mental 
expansion of the text into branch free text seq.uences, 
corresponding to classes of execution using identical 
instruction sequences. A second major step is the mental 
execution of instructions along these branch free sequences. 
These two steps are often intermingled in mental visualisation 
of the process invoiced by the text. 

The model proposed by 3obrow and Brown, [3obrow,75b] , 
consists of two separate programs one to synthesize a model 
of the environment based on input data and other to analyze 
the information in the synthesized model in order to answer 
the questions about the environment. The synthesized model 
of the environment , is the mental representation of the 
external world and its structure depends on the nature of 
the questions to be answered. Analyst retrieves knowledge 
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explicitly represented in the above model and uses various 
inferential techniques to infer facts not explicitly con- 
tained in the model. 

Lofgren, [lofgren, 77], makes distinction between an 
object and a description* The distinction is based on the 
process of learning. In the case of an object, learning 
proceeds by finding the regularity in its structure without 
transforming it to any other object of other domain. In the 
case of description, learning is preceded by transformation 
of the description into object of other domain. Ibis 
distinction between an object and a description is only a 
way of looking at the things. The more regularities or 
structures are found, the more genuine is the learning and 
less complex is the object learned. 

Shneiderman' s syn tactic/semantic model of programmer 
behaviour, [Shneiderman, 77b] , hypothesizes that the 
programming experience and education build two kinds of 
knowledge structure. Syntactic knowledge is language 
dependent, acquired through rote memorization and must be 
frequently rehearsed to prevent forgetting. Semantic 
knowledge is language independent acquired through meaningful 
learning and heirarchically organized from low level 
constructs such as meaning of the assignment statement or 
an arithmetic expression, to intermediate program structures 
such as the sequencje for clearing or sorting an array, to 



4 


higher level problem domain events such as the computation 
of a matrix inverse or a correlation coefficient. 

Brooks, [Brooks, 77] , has presented an explicit model 
for the coding process based on information processing 
theory of ^e-well and Simon, [ Newell, 72] . The theory 
hypothesizes that three distinct sorts of behaviour, under- 
standing, method finding and coding, are involved in 
performing a programming task. Understanding is characteri- 
zed by acquisition of knowledge of the basic elements of 
the problem. These include the objects with which the 
problem is concerned, their properties and relations, 
initial and final state of the objects and the operations 
available for going from initial to final states. Extracting 
information from external sources and building of internal 
representation during which new information is incorporated 
are intermingled in the process of understanding. 

It is evident from the above models that understand- 
ing is viewed as two stage process. We shall call the two 
stages as interpretation and learning stages. In the inter- 
pretation stage, explicit facts about the program are 
abstracted from the text of the program. This may involve 
mental expansion of the text into branch free instruction 
sequences and mental execution of these sequences [ Mills, 72] . 
In the learning stage, one searches for rules (implicit 
facts) hidden in explicit facts by suggesting hypothesis 
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for test and eventual confirmation into such rules. A rule 
is considered more powerful the more explicit facts it 
concerns. 

The psychological complexity of the program will he 
affected by all such factors which either aid or hinder the 
interpretation or learning of the program. Various factors 
such as comments, mnemonic names, paragraphing, different 
control constructs, have been studied for their effect on 
program understanding. Defining a complexity measure on 
the basis of such factors may prove difficult due to their 
numerosity. 

We believe that the various factors affecting the 
program understanding may be synthesized in small number 
of higher level factors. The complexity measure of these 
higher level factors can be defined on the basis of their 
effect on program understanding. 

The major contributions of this thesis are in 
identifying the various factors of program complexity and 
studying their effect on program understanding. 

The features of the program which suggest an analogy 
or establish an association with a problem doraai-n are 
expected to facilitate learning by reducing the effort in 
search of initial hypothesis to characterize the program 
structure. These features of the program are contained in 
comments, variable and subroutine names, data and control 
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organizations, etc. The comments help in establishing an 
association v/ith problem dcnnain .objects and events. Similarly, 
selection of variable names for data and their organization 
are suggestive of the association between program and the 
problan domain. The control structure and execution profile 
of the program may also help in establishing this association. 
These features of the program are collectively termed as 
factor of ’meaningfulness' of the program. 

The control structure and data structure of the 
program are also expected to play a major role both in inter- 
pretation and learning stages of understanding. More complex 
the control structure, more difficult it is to mentally expand 
the program text into branch free sequences of instructions 
and identify them. The mental execution of branch free 
sequences of instructions helps in abstracting the details 
of the program and in establishing a correspondence with the 
events of the problem domain. In the execution, the values 
are retrieved from the structure of the mental object. The 
structure of the mental object is expected to be dependent 
on the data structure of the program. Meaningful variable 
names and organization may aid in structuring the mental 
represen ta tion . 

The other important factor, which will affect 
understanding, is size of the program. There may be some 
reservations in treating size as a separate factor. The 
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increase of control and assignment statements will increase 
the size as well as control and executional complexity of 
the programs. However, due to limited capacity and volatile 
nature of mental storage, size becomes an important factor 
after some threshold level. In a study by Thayer, [ Thayer, 75] , 
it has been shown that program size is correlated with 
occurrences of actual software errors. This observation 
further underscores the importance of size. 

These informal arguments suggest that the size, the 
control structure, data structure, and execution structure 
of a program will affect the program understanding and hence 
the program complexity. The above four factors are algorithm 
dependent. The factor of ' mean ingfuln ess' will also have an 
important bearing on program comprehension. 

This research establishes size, control structure, 
data structure and execution structure of programs as factors 
of program complexity through a judgemental experiment with 
experienced programmers. The above factors will be collec- 
tively referred as "4-factors of program complexity" in 
this thesis. The effect of 4-factors on program understanding 
has also been studied. 

The data abstraction methodology is claimed to 
facilitate the understanding of data structures by formally 
abstracting their properties. Some formal aspects of data 
abstraction have been studied. 
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The next section includes a "brief survey of the 
experiments on program comprehension and different complexity 
metrics. The final section describes the organization of the 
thesis . 

1 . 1 BRIEP SURVEY 

Weissman ,[Weissman ,73,74] , in his experiment studied 
the effects of comments, paragraphing and mnemonic variable 
names on program understanding. The experimen t -was conduc- 
ted with students in a computer science course. Several 
versions of two programs having different levels of factors 
of interest were used in the experiment. Three measures of 
program understanding: (i) how well the subjects hand 
simulated the program, (ii) how well they were able to fill 
in the blanlcs in a paragraph describing a program and 
(iii) the subjects own evaluation of how well he understood 
the program, were used in the experiment. The ranking of 
one’s own understanding of the program was done on a scale 
of 0-9. The results of the experiment indicated the strong 
dependence of program understanding on these three factors. 
Comments and paragraphing facilitated under standing. Mnemo- 
liic variable names made both comprehension and modifi- 
cation easier than non-mnemonic names. However, the mnemonic 
variable names tended to slow down the subjects in hand 
simulation. 4. large number of errors were made in a combi- 
nation where comments were present and mnemonic 
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varialDle names were alDsent. The interaction effect between 
paragraphing and comments was against the intuitive expecta- 
tion. Paragraphing hindered understanding in presence of 
Comments whereas it helped understanding in their absence. 

Ben Shneiderman had investigated several programming 
issues such as program composition, comprehension, debugging 
and modification. In bis early experiment [Shneiderman, 76 ], 
he compared the effect of Logical IP vs. arithmetic IF 
statement of PORTRaH on program understanding. The programs 
constructed using logical IP’s proved easier to understand 
than the programs constructed using arithmetic IP’s for 
novice programmers. However, there seemed to be no 
difference in the two syntactic forms for experienced 
programmers. 

In another experiment he studied the program 
memorization task. Pwo short PORTRA.tr programs were used 
in the experiment. One was a proper executable program 
whereas the other contained valid statements in scrambled 
order. The results indicated that the program structure 
facilitated comprehension and memorization, -^s experience 
increased, the ability to memorize the proper program 
increased rapidly while ability to memorize the shuffled 
program showed minimal change. 

The other results of his series of experiments, were: 
(i) Retailed flowcharts were found to be of no detectable 
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value in program oomposition, comprehension, debugging or 
modif i cat iorJSih neiderraan , 77a] . 

(ii) High level comments were significantly better than 
low level comments in aiding memorization; but in debugging, 
the presence and absence of comments seemed to be irrelevant 
[ Shneiderman , 77b] . 

(iii) In difficult programs, mnemonic variable names 
made comprehension much easier than non-ranemonic names for 
experienced programmers. However, in simple programs it 
did not have any effect. 

(iv) Modular programs were less difficult to comprehend 
than non -modular ones. The random modular programs were 
most difficult to comprehend among the three Shneidemman ,7S . 

(v) In debugging, the subjects performed better with 
indented unfamiliar programs than with non-indented programs 
[Shneiderman, 79] . 

Sheppard et al., [ Sheppard, 7S] , studied the effect 
of mnemonic variable names, program class and structure on 
programmers understanding of programs. Three general classes 
of programs used were: engineering, statistical, and non- 
numeric. Three levels of program structure defined were: 
structured, quasi -stru ctured, and unstructured. The 
structured level adhered strictly to the tenets of struc- 
tured programming [Dijkstra, 72]. Program flow proceeded 
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from top to 'bottom with one entry and one exit. Neither 
bachward transfer of control nor arithmetic IP's were allowed. 
In quasi-structured level, use of ‘backward GO TO statements 
and multiple exi)^fts were permitted. Awkward constructions 
resulting from rigorous application of rules of structured 
programming were eliminated. The unstructured level programs 
contained many GO TO statements. Backward transfer of control 
was not restricted and arithmetic IP's were allowed. The 
results of the experiment indicated that program class and 
program structure affected program understanding while no 
relationship was found for mnemonic variable names. 

Some similar studies were conducted to study program 
construction. Sime, Green and ^est, [ Sime, 73], compared 
the implementation of condit 'Lonals. Subjects were required 
to write five increasingly difficult programs. Subjects 
were divided in two groups. Both the groups used the two 
implementation of conditional alternatively to write 
programs. The group using implementation in a structured 
IP-THM-ELSE manner had less difficulty constructing the 
programs than the group using implementation via tests and 
forward branch statements. 

Brooks' work attempts to develop a theoretical 
frame work for the study of cognitive process involved in 
understanding, method -finding and coding process in program 
writing [Brooks, 77]. His experimental results had 
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demonstrated the sufficiency of the theory in explaining the 
observed programming behaviour in coding process. 

A limitation common to most of the experiments 
surveyed above is that they are all single factor experi- 
ments. These single factor experiments do not permit the 
study of interaction among the factors. The various factors 
studied are not algorithm dependent. 

In parallel v;ith these works, several complexity 

metrics were developed for programs. The first theory was 

proposed by Halstead, [ Halstead, 75], which states that 

programs have measurable characteristics analogous to 

physical laws. He defined the program size in terms of 

four basic measures: (i) number of distinct operators, 

(ii) number of distinct operands, (iii) total number of 

occurrences of operators, and (iv) total number of occurren- 

o— 

ces of operands in the program. The languge level, program- 
ming effort and programming time are also defined on the 
basis of these four measures. The correlation between 
program size and mental effort required to generate the 
program is established on the basis of hypothesis that the 
program is generated by selecting items, such as operators, 
operands, etc., from language vocabulary by means of binary 
search. 

Ronback;, [Ronback, 75], associates a measure with 
control flow of the program to characterize its h^rarchioal 
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structuredness. The structuredness of the system is defined 
in terns of number of levels in the h^rarchy to structure 
the program. An ideal structure which maximizes the number 
of levels would require only two components to make a 
subassembly at the next level. The absolute structuredness 
of a system can be compared to an ideal structure to get the 
ratio of structuredess . The number of absolute levels in the 
system is computed from the control flow graph of the program. 

In defining structuredness, the complexity of the proof of 
the correctness at any node of any level is also taken into 
account. The above measure has not been tested for its 
validity. 

Amster et al.s[Amster, 76], used percent of the 
statements that affect control flow as a measure of complexity. 
However, this measure has a problem because complexity can be 
held constant as the size of the program increases. 

McCabe, [McCabe, 76], has defined the complexity 
measure in relation to the decision structure of the program. 

His complexity metric, V(G), is the classical graph -the ore tic 
cyclomatic number defined as: number of edges - number of 
nodes +nurQber of connected regions. Sheppard et al., 

[Sheppard, 78], evaluated the relationship between 
comprehensibility and three program matrices (i) Halstead’s 
measure of programming effort, (ii) McCabe's cyclomatic 
number of control flow graph, and (iii) the number of 
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statements in the program. It was found that both Halstead’s 
and McCahe's measures were related to program understanding. 

Woodward e t al .,[ Woodward, 79], has forwarded "knots" 
as a measure of control flow complexity of programs. The knot 
is intersection of directional lines representing the flow of 
eontrol. The knots can be defined mathematically as follows. 
If a jump from line a to line b is represented by ordered 
pair of integers (a,b), then jump (p,q.) gives rise to a 
'knot’ with respect to jump (a,b) if either 

(i) min(a,b) < min(p,q)< max(a,b) and max(p.q) > max(a.b) or, 

(ii) min(a,b) <max(p,q)< raax(a,b) and min(p,q)< min(a,b). 

The count 'of number of "knots" in a program gives a measure 
of complexity. 

In our opinion, the measures described above do not 
reflect the psychological complexity of programs.- These 
measures reflect only numerosity aspect of control structure. 
The numerosity may be an important parameter in interpretat- 
ion stage. But in the learning stage the regularity or 
pattern of control structure is more dominant. The regularity 
may exist at statement level, subroutine level or even at 
problem dom^-in level. Hone of these metrices captures the 
difficulty in learning stage. 

Purther, the units used in defining the measures, 
such as operators, operands, basic paths, etc., do not seem 
to represent the cognitive units used by human beings for 
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program understanding. It se-ems more prodatle that people 
use higher level structures as cognitive units in under- 
standing. Hov/ever, v/e feel that these measures are ■well 
suited for program construction tasks because the basic 
units used in the task are same as used for the measures. 

Other limitations of the control complexity measure 
are that they do not take into account the level of nesting 
and the effect of data values on the control flow. Associa- 
tion of data values with control paths seems to be an 
important -part in program comprehension. 

1.2 ORGAT^ISATIQ-N OP THE TI-IESIS 

Chapter 2 of this thesis describes a four factor 
judgemental experiment with experienced programmers. The 
experiment was conducted to find whether the factor of 
size, Control structure, data structure and computation 
structure contribute to overall program complexity. In- 
formation integration theory , [Anderson , 72 ] , was applied 
to identify the underlying rule of integration of various 
factors for overall program complexity. The result 
indicated that 4-factors contribute independently to 
perceived program complexity. 

Three experiments with novice programmers are 
described in Chapter 3. The first experiment studies the 
effects of meaningfulness of the expressions and the 
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number of operands in the expression on their evaluation. 

It is observed that the evaluation of meaningful expression 
is easier than meaningless ones. Further, the expression 
e valuation.is affexrted by number of operands. 

In the second experiment, the effect of factor of 
’meaningfulness’ and data structure complexity on program 
understanding were studied. The meaningfulness of the 
program"' was controlled by selection of variable names 
which were suggestive of their meaning and purpose and 
problems of varying degrees of familiarity for the subjects. 
Results suggest that it was easier to comprehend the meaning- 
ful program. However, there was an exception to this conclu- 
sion. The numerosity of data structures did not show any 
significant effect on program u-Tderstanding. This conclusion 
contradicted the intuitive expectation. However, the 
experiments of Chapter 4 explained the reasons for these 
unexpected results. Two measures of program comprehension 
were also compared. 

The last experiment of the chapter studied the 
effect of control structure and execution structure complexi- 
ties on program understanding. The programs used in the 
experiment were meaningful ones. The execution structure 
complexity affected the program understanding. 

The experiments of Chapter 3 were 1- and 2-factor 
experiments and the subjects were novice programmers. 
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Interactions between factors conld not be studied in these 
experiments. The class-room environment and strict time 
schedule for the course forced the use of small programs 
in the experiment and constrained us from designing multi- 
factor experiments. 

Chapter 4 describes a 4-factor experiment with 
experienced programmers to study the effect of main factors 
and their interactions on program understanding. Another 
experiment was conducted to test the effectiveness of the 
subjective rating of program complexity. 

The results of the experiment emphasized the 
importance of data structure in program understanding. The 
data abstraction methodology is claimed to facilitate under' 
standing of data structures by abstracting their relevant 
properties . 

In Chapter 5, a brief survey of various data 
abstraction techniques is given. It is proved that Parnas’ 
early (1972) specification technique is powerful enough to 
specify any computable function. Purther, the completeness 
and consistency of the specification are proved to be 
undecidable. Some modifications are suggested in Parnas’ 
technique to facilitate the specification of exception 
handli’tg strategy and ’delayed' effects. 
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The last chapter, Chapter G, summarises oonaLusioTis 
from the major works of earlier chapters, discusses the 
limitations of the experiments and suggests possible 
directions of the future research. 



CHA.PTER 2 


PERCEPTIOIT OP PROGRAM COMPLEXITY 

Informal but intuitive arguments -were advanced in the 
previous chapter 'which identi.-'ied size, con trol .structure, 
data structure and execution structure complexities as 
contributing factors of overall program complexity. In this 
chapter an attempt is being made to empirically verify their 
influence. 

The main purpose of the experiment described here 
was to answer the following questions: (a) Do factors of 
size, co'trol structure, data structure and execution struc- 
ture of the program contribute to program complexity?.- and 
(b) How do they contribute to program complexity? Answers 
to these two questions would help design other experiments 
on program understanding. 

The organisation of this chapter is as follows. 

Pirst section contains a brief introduction to information 
integration theory [ Anderson , 72], the second section 
describes experimental method and design, and the final 
section includes analysis of data and discussion of the 
results. 

2.1 lYPORMATIOF ITITEGRATIOM THEORY 

Infomation integration theory is a unified, general 
theory of human judgement. Information refers to details 
about one's surroundings either in coded or uncoded fom 
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on the basis of which judgements are made. Integration 
refers to the process whereby information about several 
'coacting stimuli are combined to produce an overall response 
or judgement. The underlying idea of the theory can be 
described with the help of the diagram in Figure 2.1, 
reproduced from [ -^nderson , 7 ?]. 

In Figure 2.1, , $2 and represent physical 

stimuli or external information available for judgement. 
These stimuli activate various sense organs and get 
converted into psychological stimuli, s-^ , S2, s^. In the 
case of simple sensory stimuli, the conversion process is 
known as psychophysical law* In general, however, it is 
known as ’valuation function’ , V. The psychological values 
s^ , 82, s^ are combined by the individual in making judge- 
ments. This co-ordination process is known as ’integration 
function' , I, and the judgement or response is known as 
’covert response', r. This covert response is transformed 
by response function, M, into 'overt response’, R. 

The integration function, I, is a psychological law 
which relates the internal stimuli to the internal response. 
The response function, M, relates this covert response to 
the overt measurement scale imposed by the investigator. 

If M is a linear function, then R is said to be on interval 
scale . 

In integration theory, simple algebraic models are 
used to describe the integration process, I. These simple 
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FIG.2.1 FUNCTIONAL MEASUREMENT DIAGRAM 
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algebraic models of perception and judgement have given a 
detailed quantitative account of fairly complex cognitive 
activities. 

Most of the models fall in one of the t-wo main 
classes. On© class includes adding, subtracting and 
averaging models; the other includes multiplying and divid- 
ing models. Adding and subtracting are formally similar. 

But they may be psychologically different. Adding and 
averaging are different both mathematically and psychologi- 
cally. Under certain situations, however, they make identical 
prediction and have a very simple analysis. Similar is the 
case with multiplying and dividing models. Mathematically 
they are similar but psychologically they are different. 
Dividing models have been used in comparative judgements, 
while multiplying models have been employed in work on 
utility theory [Shanteau, 72]. 

In algebraic models, two stimulus parameters — scale 
value (_s) and weight (w) , have special importance. The _s is 
considered as the location of stimulus along the dimension 
of judgement. The w is viewed as the relative importance 
of different pieces of information coming into judgemental 
task. Both weight and value are dependent on the dimension 
of the judgement. Bach task sets in a valuation operation 
by virtue of which the stimulus parameters are defined. 
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Since the averaging model of integration process has 
accounted for many facts about human judgements and has 
provided a general basis for estimating weight parameters, 
a brief discussion of the model is given below. In the 
averaging model, the response, R, to a set of stimuli can be 
written as 




R 


= .1 ii/ 


1=0 


N 

I 

i=o 


^-i 


( 2 . 1 ) 


The averaging model is a special case of linear model with 
the constraint that the relative or effective weight, 
w./ y w. , sum to unity. The development of integration 

— “ 1 . , L *“1 

1 = 0 

theory has shown that it is necessary to allow for an internal 
state variable or general impression as well. This is treated 
as another stimulus, S^, with value and weight, s^ and w^, 
respectively. In many cases, represents the integrated 
result of the past experience and the current state in an on- 
going task. However, may also represent disposition as 
well as motivational variables. There are different cases 
of the averaging model, but the simplest is equal weight 
averaging. It is like a linear model and can be handled 
in a simple way. 

Suppose that two stimulus variables, program size 
and its control structure, for example, are thought to 
combine in an additive or linear way to determine the 
overall complexity of the program. The two variables are 
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combined in an ordinary Row x Column factorial design. 
Several levels of one stimulus variable constitute the rows 
of the design and several levels of the other stimulus 
variable constitute the columns of the design. Each cell of 
the design thus corresponds to a pair of stimuli. This 
factorial design can be extended for any number of stimuli. 
The equal weight co-edition requires that all stimuli within 
each factor of the design have the same weight. Eor a two 
factor design, the row stimuli would all have a common 


weight, Wj^, and the column stimuli would all have a common 
weight, W-. When this condition is satisfied, then the 

— — Q 


averaging model reduces to 


R. , = 0 + 

o 


W S + W.n s ^ . + w^ s„. 
— 0 -0 -R -Ri -c -C3 


w + Wq + w„ 

— o — R — G 




! 

+ W S . 


-c -C3 


( 2 . 2 ) 


where R- • is the response due to combination of ith row and 
a 0 

jth column stimuli 0^ is a constant which allows for an 
arbitrary zero on the response scale. Wj^ and w^ are 
relative weights of the row and column stimuli, respectively. 
Equation (2,2) ignores the error variability. The above 
equation implies that data should exhibit a simple pattern 
of parallel lin es [ Anderson , 74b]. 

The validation of integration model is straightforward. 
It requires a test of goodness of fit. If the integration 
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function is validated empirically then this function itself 
can he used as the base and frame for estimating subjective 
value of the stimuli. This idea is known as "functional 
measurement". This approach is used to answer the two 
questions posed at the outset. 

There is a direct relationship between equation(2.2) 
and the structura l model used in analysis of variance 
[Winer, 71]. The graphical prediction of parallelism is 
equivalent to a zero Row x Column interaction. If the 
parallelism is observed, that supports the linear model of 
equation (2.2). Purther, it indicates that the response 
measure is on interval scale. That follows because unequal 
intervals would produce non parallelism even with a correct 
model. Finally, it can also be shown that the row means of 
the design are interval scale estimates of subjective value 
of the row stimuli and similarly for column stimuli. 

Similar estimates can be made from multiplying 
integration rule. Puctional measurement for the multiplying 
model is the same as for the linear model. Multiplying rule 
may be hypothesized as 



0 

0 



( 2 . 3 ) 


If s . were known, then each row of the data would plot as 
cj 

a straight line with a slope s-^^. Or if the Sj^^ were known 
then each column of the data would plot as a straight line 
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with slope s. .. In either case, the data would exhibit the 
form of linear fan. If the model is empirically supported, 
then the column means of the design are interval scale 
estimate of s In practice, therefore, the observed 
column means may be ies used as provisional values of _s . to 
test the linear fan prediction- 

2.2 EXPERIMEtrT DESIGAT 

In this experiment, subjects received information about 
the size, control structure, data structure and execution 
structure complexities of several hypothetical programs and 
gave their judgement about the overall complexity of each 
program. Erom the judgemental data, an integration function 
was hypothesized and empirically verified. This integration 
function in turn helped to assess the contribution of factors 
to program complexity and their interaction. 

Some salient features of the experimental methodology 
are mentioned here. The response scale chosen was a 31-poin.t 
graphic scale to facilitate fine discrimination among the 
programs. A small number of steps in the scale would have 
been susceptible to the end effects, whereas large number 
of steps would have encouraged lumping. It was expected 
that a 31 -point scale wo’Uld give a’noptimal response. The 
other advantages of using graphic scale are that it avoids 
the tendency for 'residual number preferences’ and. eliminate-s 
memory effects [ Anderson , 74a]. 
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To remove end effects and consequent nonlinearity in 
the response scale, stimulus end anchors were used. These 
end anchors were stimuli more extreme in either direction 
than the regular experimental ones. The end anchors were 
intended to enable the subjects to use entire response scale 
and to obtain regular experimental data from the interior of 
the scale. 

As range of the stimuli and the response scale were 
arbitrary, some experience was needed to develop a frame 
of reference for the task. Preliminary practice and end 
anchors were employed to set the subjects frame of reference 
as is usually done in functional measurement. 

To minimize the effect of the order of presentation 
of stimuli, the stimuli were presented in different shuffled 
orders to different subjects. The order and carry-over 
effects were further minimized by presenting the stimuli 
in two different orders. Some subjects received the 
description of the program in one order. Other received 
the program description written in the reverse of that 
order. 

2.2.1 Methodology: 

The details of the experiment and its design are 
presented below, 

A. Stimuli: 

Stimuli in the experiment were the information about 
the program size, its control structure, data structure and 
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execution structure Complexities as described below under 

the design section. These four information constituted the 

four factors of the experinien t. Each of the above four 

factors had three levels. In the case of program size, the 

levels were small , moderate and big . Eor control structure 

and data structure complexities, the levels were simple , 

moderately complex , and very compl ex . The last factor, 

execution structure complexity, had low , moderate , and high 

as the three levels. Since there were four factors, each 

/ 4 ^ 

having three levels, a total number of 81(3 = SI j stimulus 

combination were generated. These program description will 
be referred as four factor design. Each combination of the 
stimuli was typed on 4" x 6” index card. The different 
factors were listed in vertical order on the card. 

There were 54 two factor program description also. 
These program des criptions con tained information about only 
two of the four factors. Since these stimuli set had 
information about only two factors, it will be referred as 
two factor design. There were thus six types t^^/o factor 
program descriptions. 

To control the probable effect of presentation of 
factors the above mentioned 31 four factor and 54 two factor 
program description were typed in reversed order also. Two 
typical stimulus card from each set are shown below. 
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Original Order 

Program Size = Small 

Control Structure = Moderately Complex 

Data Structure = Simple 

Executional Complex* = Moderate 

Reversed Order 

Executional Complex = Moderate 
Data Structure = Simple 

Control Structure =» Moderately complex 

Program Size = Small. 

Along with these 1 35 » (81+54) , program descriptions, 
two end anchors were also presented. Practice set of stirauli 
contained two end anchors and eight other stimulus oomhination 
selected randomly from the 81. These end anchors and practice 
examples are given in Appendix A. 

B. Subjects: 

There were 12 subjects, tjine of them have been 
working as System Programmers at Computer Center of Indian 
Institute of Technology,iianpur. Their experience in 
programming ranged from 2 to 10 years. They had a thorough 
experience with PORTRAF and Assembly language and good 
familiarity with COBOL, Autocoder, etc. Two of the subjects 
were senior graduate students in the Computer Science Program, 
working for their doctoral degree. They had approximately 
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3 years of experience in PORTHAW programming. They also knew 
OOBOL and Assembly language. One subject was a senior faculty 
member in the Computer Science Program. He has been teaching 
programming courses at undergraduate and graduate levels for 
approximately six years. All these subjects had experience 
of repeatedly trying to understand others programs to help 
them in debugging and maintenance. This experience was gained 
by working as consultants, programming laboratory incharge or 
instructor. Their long experience of understanding and 
maintaining others programs made them a good choice as 
subjects for this e >cperiment.i 

G. Procedure: 

The experiment was conducted one subject at a time 
in a small room. On arrival, the experimenter explained 
verbally the purpose of the experiment. The subjects also 
received a typed sheet of instruction which described the 
nature of the experimental task and his role as subject. The 
instructions are reproduced in Appendix B. The subject was 
urged to understand the task clearly and to seek clarification 
in case he had any doubt. Each subject worked with ten prac- 
tice examples. The subject was instructed to read the 
information about the four factors of hypothetical program, 
to form an opinion about its overall ecomplexity and to rate 
its complexity on a 51 -point scale. After completion of 
practice examples, the subject received 137, (81+54+2) cards 
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containing the description of hypothetical programs. Cards 
were shuffled thoroughly. The suboect read each card and 
rated its complexity. After rating all cards, a break of 
20 minutes was given. The entire procedure was repeated 
(excep'; the practice texample) by the subject after the 
break. As pointed out earlier, there were two sets of 
cards containing the stimulus combination in the normal and 
reversed order. Half of the subjects worked on one set of 
the cards; the other half worked on the second set of the 
cards. 

H, Design: 

The experiment had two sets of stimulus design. One 
set had 3x3x3x3, program size x control structure x data 
structure x execution structure complexities, factorial 
design. This desig-' we have referred as 4-factor design. 
Another set of design had six 3x3, program size x control 
structure, program size x data structure, program size x 
execution structure, contrQlstructure x data structure, 
control structure x execution structure, data structure x 
execution structure complexities, factorial design. This 
design has been referred as 2-factor design. 

2.3 RESULTS AND DISCUSSION 
A. Graphical Analysis: 

A simple and useful test of the integration model 
can be made by plotting the raw data. The mean program 
complexity was therefore plotted as function of program 
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size X control structure, program size x data structure, 
program size x execution structure, control structure x data 
structure, control structure x execution structure, data 
structure x execution structure complexities. In Eigures2.2 
and 2.3 the column levels are spaced according to their 
functional values [Anderson, 76]. These six sets of 2-'way 
graphs of Figures 2.2 and 2.3 are from 2- and 4-factor 
design, respectively. 

In general, the curves of Figure 2.3 are smoother 
than those of Figure 2.2. This is not surprising. The means 
used in Figure 2.3 were obtained by pooling a larger number 
of observations than those used in Figure 2.2. 

The six sets of curves in Figure 2.2 do not exhibit 
a common trend, ilone of the curves show clear parallelism. 

It is therefore difficult to hypothesize any one of the 
integration models by oust looking at the nature of the 
curves. If we hypothesize an additive integration rule, 
the deviations from parallelism have to be treated as minor non' 
linearity in response usage or idiosyncra tic oudgements 
of a few stimulus combinations. In fact, the deviation 
from parallelism in each set appears to be attributable to 
only one point of the set. On the other hand, if we decide 
to choose the multiplying model then these deviations from 
parallelism have to b e treated as genuine. To arrive at 
any convincing interpretation. Figure 2.3 should be examined. 
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The four sets of curves, program size x execution 
structure, control structure x execution structure, data 
structure x execution structure complexities, of figure 2,3 
clearly exhibit parallelism. The curves of program size x 
control structure have a diverging trend whereas the curves 
of control structure x data structure have 'a converging trend. 
In these two curves, also, the non parallel ism centers around 
one point only. Over the two desig'^s, only curves of program 
size X control structure show a consistent pattern. Again 
it is difficult to choose between an additive and multiplying 
model from these graphic analysis. 

B. Statistical Analysis: 

To supplement ■ the qualitative evidence from the 
graphical analysis, an overall analysis of variance was 
performed on the data of 4-factor design. In this 12x2x3x3x3x3 
analysis of variance, the factors were subjects, replication, 
program size, control structure, data structure and execution 
structure complexities. For 2-factor design, six separate 
12x2x3x3 analysis of variance were performed. In these 
analysis, the factors were subjects, replication and two of 
the four factors' of 4-factor design. As each subject rated 
all the stimuli, interaction of the subject with a source 
of variance was used as error term in the construction of 
F ratios [Winer, 71, p.202 ]. 
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P ratios for all 2-\i3.y interaction for loth the designs 
are presented in fahle 2.1. The interaction terms -were signi- 
ficant in program size x execution structure, P(4,44) = 2.67, 
£< 0.05, control structure x data structure, P(4,44) = 2.69, 
£< 0.05, and control structure x execution structure, 

P(4, 44) = 3.S4, J2 < 0.05, in 2-factor designs. In 4-factor 
design, the significant interactions were program size x 
control structure, P(4, 44) = 6.56, £< 0.05, and the control 
structure x data structure, P(4, 44) = 4. 23, £< 0.05. Ihe 
control structure x data structure interaction seems to be 
reliable. It is present in both 2- and 4-factor designs. 
According to an additive model, the interaction terms in 
the analysis of variance should all be nonsignificant. The 
two significant interactions mentioned above thus cast 
doubt on the applicability of an additive model. 

To have a more powerful argument either for additive 
type model or multiplying model, it is appropriate to test 
the goodness of the fit of these two models to the data. The 
mean absolute deTiations between the predierted and observed 
means was used as criterion for comparing the two models. 

The means from both 2- and 4-factor designs were prepared 
in a Row x Oolumn table for all the six sets of 2-way 
interactions. The entry of each cell was observed mean, 

OV. To obtain the theoretical value, TO, for each cell, 
in the case of additive model, the marginal means for each 
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TABLE 2.1 

F RATIOS FOR 2-¥AY INTERAOTIORS 


S. ITo. 

Interacting Factors 

2-Factor 

Design 

4-Factor 

Design 

1 

Program size x Control 

sti*ucture 

1 .51 

6.36* 

2 

Program size x Data 

structure 

1 .80 

0.22 

3 

Program size x Execution 

struotur e 

2.67* 

0.44 

4 

Control structure x 

Data structure 

2.69* 

4.28* 

5 

Oontrol structure x 
Execution structure 

3.84* 

2.38 

6 

Data structure x 

Execution structure 

1.35 

1 .31 


* indicates statistically, significant 
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row and column and the grand mean were computed. Let 

^ ^ and G denote ith row marginal mean, jth column 
marginal mean and grand mean. For any cell, ij, the 
theoretical value, TO. is 

TO . = 5 + 0 G (2.4) 

xj X* •J •• 

The mean absolute deviation was computed as follows: 

E = I I 1 (07 - T0)| /n (2.5) 

i d 

In the present case n is equal to 9. 

In the case of multiplying model, TO for each cell 
was estimated according to equation given below: 

T0..=H. x^./(? (2.6) 

Xj X* •J ♦♦ 

and the mean absolute deviation was again calculated 
according to Equation (2.5). 

The mean absolute deviation from both the models 
are shown in Table 2.2. From entries in the table, it is 
evident that mean absolute deviations are less under 
additive model as compared to under multiplying one. This 
trend is true in all but one case. In other words, the 
additive model gave a better fit than the multiplying one. 

To prove the superiority of additive over multiplying 
model, the absolute mean deviation from the predicted values 
of the two models were compared. The sixth column of Table 2.2 
lists the t ratios for the difference between the two models. 
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TABLE 2.2 

MEAIT ABSOLUTE DEVIATIONS 


Type of 
Design 

S.No. 

Row Pact or x Oolumn 
Eactor 

Additive 
Rul e 

Multiplying 

Rule 

t 

Ratio 


1 

Program size x 

Control structure 

0,463 

0.784 

1 .59 


2 

Program size x 

Data stru cture 

0.495 

0.85 

1.94* 

2-Eactor 

3 

Program size x 
Execution structure 

0.604 

0.704 

0.2 


4 

Control structure x 
Data structure 

0.653 

1.10 

3 . 05 * 


5 

Control structure x 
Execution structure 

0.610 

0.6 

0.17 


6 

Data structure x 
Execution structure 

0.453 

0.603 

1.36 


1 

Program size x 

Control structure 

0.350 

0.266 

0.737 


2 

Program size x 

Data structure 

0.066 ' 

0.246 

2.36* 

4-Eactor 

3 

Program size x 
Execution structure 

O.OS 

0.12 

1.4 


.4 

Control structure x 
Data structure 

0.603 

0.910 

3.85* 


5 

Control structure x 
Execution structure 

0.146 

0.248 

5. 29* 


6 

Data structure x 
Execution structure 

0.126 

0.187 

2 . 904 * 


t 55 ( 3 ) = 1.86 (one tail^i) 


* indicates statistically significant 
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It can be seen that the additive model appears to be superior 
to multiplying model in two cases of 2-factor design and in 
four cases of 4-factor design. Prom the results at hand, it 
seems reasonable to claim that an additive model gives a 
better account of the data than does the alternative multi- 
plying model. 

G. Single Subject -Analysis; 

To supplement the results of the previous sections 
and explain the minor discrepancy from parallelism in the 
main analysis mentioned earlier, separate analysis was made 
at the level of each subject. Individual analysis, it should 
be noted, are important to check that the group averages are 
not hiding alternative integration strategies by different 
subjects [Singh, 79], [Shanteau, 69 ] . Table 2.3 presents 
F ratios for all 2-, 3- and 4-way interactions for each 
subject in the main 4 -factor design. 

Inspection of Table 2.3 fails to reveal any meaning- 
ful pattern. It shows that significant interactions are only 
a few in number and that they are scattered. The 3- and 4-way 
interations are statistically significant in case of only one 
or two subjects. These subjects also disclose inconsistent 
patterns in the significant interactions. It appears 
reasonable, therefore, to accept our earlier interpretation 
of additive rule for the judgement of program complexity. 



TABLE 2. 5 

F RATIOS PROM SINGLE SUBJECT ANALYSIS 
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D. Supplementary Analysis: 

Although the main purpose of the experiment was differ- 
ent, some of the results of this experiment have implications 
for information integration theory. 

If all the stimuli have the same weight, averaging 
and adding models make identical prediction. A simple adding 
rule implies that every addition of information pertaining to 
program complexity should increase the overall program comp- 
lexity* In contrast, an averaging model predicts a decrease 
in the overall program complexity when two pieces of informa- 
tion about the program are of unequal value. To plot all the 
six sets of 2-way curves, mean program complexity were obtained 
from 2- and 4-factor designs. In the case of 2-factor design, 
each mean is based on only two factors. But in the 4-factor 
design, each mean is based on the two factors as well as the 
average of the two other factors of the design. Since the 
mean program complexity in the case of 4-faotor design is 
obtained by pooling over more number of factors compared to 
2-facto design, the former should be larger than the latter 
in case adding rule is operative. But in averaging operation, 
the former should be lower than the latter. 

To diagonise the real model, it is only necessary 
to plot the curves from 2-and 4-factor designs. If- averaging 
rule is operative, then at least one of the curve of 2-factor 
design should cross over at least one of the curves of 
4-factor design. This crossover is indicative of averaging 
rule [ Anderson, 65] * 
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Six sets of 2-way curves for mean program complexity 
were plotted in 5’igure 2,4. The column levels are equally 
spaced. The dotted curves represent data from 4-factor 
design, whereas the solid curves represent from 2-factor 
design. 

Prom the curves, it is evident that the rule of 
integration is averaging type. In all sets of the curves, 
solid lines cross-over the dotted lines. The result thus 
confirms the averaging rule and infirm the alternative 
adding rule. 

This success of the averaging model is not surprising 
in view of the mass of evidence that supports the averaging 
hypothesis in many different domains [ Anderson , 65], 

[Dalai, 78], [ Singh, 75,79]. It is, however, interesting 
to note that the present success extends the generality of 
the averaging model to a completely novel judgemental task 
and to a different subject population. Accordingly, it can 
be said that averaging rule is perhaps a general way of 
co-ordinating information. 

E. Conclusions: 

From graphical analysis, analysis of variance, and 
from test of goodness of fit, we conclude that additive type 
model explains most of the experimental data. Selection of 
additive integration model accomplishes two goals simultane- 
ously *. 
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(i) It indicates that response is on interval scale and 

(ii) It yields interval scales of the stimulus variables. 

The question posed in the beginning can be answered 
very easily on the basis of this validated additive integra- 
tion rule. It is easy: to prove that marginal means of the 
data table are the stimulus values on validated interval 
scale. The marginal m^ans along with S’ ratios of the 
four factors of the experiment are presented in Table 2.4.- 


TABLE 2.4 : MRG-IATA.L MEANS OP FACTORS 


S.No. 

Factors 

Level 1 

Level 2 

Level 3 

P(2,22) 

ratios 

1 

Program size 

14.05 

IS. 71 

• 

C\J 

38.75* 

2 

Control structure 

13.65 

13.73 

21 .71 

29.58* 

3 

Data structure 

15.32 

13.60 

20.18 

45.13* 

4 

Execution structure 

17.03 

13.22 

13.85 

26.60* 


* indicates statistically significant. 


As all the four P ratios were highly significant, it 
can be said that the four factors of size, control structure, 
data structure, and execution structure complexities contri- 
bute to overall program complexity in the opinion of the 
programmers. This result suggests that the program complexity 
is dependent upon all these factors and that each factor makes 
an independent contribution to program complexity. 
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If we look at the marginal means of each factor, we 
find that the first and third values differ markedly across 
the four factors. This implies that the different factors 
were felt to vary in importance. To prove it clearly, ratios 
of sum of squares for each factor were taken with the total 
variance [Moll, 77]. These ratios for program size, control 
structure, data structure and execution structure complexi- 
ties turned out to he 0.193, 0.235, 0.037 and 0.012 respec- 
tively. From these ratios we can say that the above four 
factors can he ordered in terms of their relative importance 
as: control structure, program size, data structure and 
execution structure complexities. 



CHAPTER 3 


EXPERIMMTS OH PROGRAM COMPREHEHSIOH 
WITH NOVICE PROGRAMMERS 


The main finding of the experiment described in 
Chapter 2 is that the four factors, viz., program size, 
control structure, data structure, and execution structure 
complexities contribute independently to the perception of 
overall program complexity. This result defines a boundary 
for further experiments. Since the results of the previous 
chapter were based on judgemental data, it appeared proper 
to study the effect of these four factors on actual program 
comprehension . 

The organization of this chapter is as follows. The 
first section discusses different measures of program under 
standing, the second section contains discussion about the 
suitability of different program complexity matrices, and 
the last three sections include the details of the three 
experiments, analysis of their data and the results. 

3.1 MEASUREMENT TEEIITIQUES 

The program comprehension can be measured by 
different techniques. Weissman, [ Weissman, 73], used 
three measures of program understanding — hand-simulation 
of the program, filling in the blanks in a paragraph 
describing the program and a subjective rating of program 
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understanding. Shneiderman, [ Shneiderman , 76], had discussed 
other measurement techniques. The program modification task 
may take a long time and can usually he accomplished with 
only Comprehension of a section of the program. Similarly, 
location of a hug, response to a set of questions about the 
values of variables, program’s output, etc., can he done 
only with partial understanding of the program. Shneiderman, 
[Shneiderman, 76,77], and Sheppard et al., [Sheppard, 73], 
used ability to memorize the program as a measure of program 
comprehension. Memorization of a complex program can only 
be achieved by abstracting meaningful structures at program 
level or at a higher level of problem domain. 

Success at memorization implies that the subject has 
understood the low level details of each statement, inter- 
mediate level grouping and the overall function of the 
program. Subjects have to apply their knowledge of syntax 
of programming language in reconstructing the program. The 
reconstructed program reflects the success of the subjects 
in conveying the semantics of the program through syntax of 
the programming language .0 ommuni eating through the rigid 
syntax of the programming language may demand aro extra effort 
on part of the subject. This will be certainly true in the 
case of beginners, if not for the experienced programmers. 

Other limitation of the technique is due to the fact 
that, there is no reasonable basis to decide about the time 
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to be allotted to the subjects for understanding the program. 
The learning theory within the framework of network models 
assumes that lower order cogits of an information structure 
are acquired, strengthened and associated during learning 
[Roth, 77]. The process of establishing, strengthening and 
associating repr esen tat ion and then uritizing configurations 
of associated representations are applied recursively as 
learning progresses. Both structural representation and the 
processing of knowledge change qualitatively as learning 
progress [ Roth , 77]. 

The recall and recognition of partial and complete 
knowledge structure should vary according to current state 
of the memory representation of the complete structure. 
Horowitz et al.,L Horowitz, 72], ha^ investigated this 
problem. The suitability of the recall as a measure of 
program comprehension is doubtful because of its dependence 
on time allotted for learning. Recall would prove to be a 
good measure if sufficient time has been given for uniti- 
zation of the mental representation [Roth, 77]. 

In the case of small programs (less than 100 lines) 
it would be difficult to differentiate between rote memori- 
zation and understanding from the reconstructed programs. 

Accordingly, we have decided to use both description 
of program in natural language and program reconstruction 
as measure of program und erstanding. This would also allow a 
comparison of the two measures. 
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3.2 OOMPLEXITY METRICS 

A "brief survey of various complexity metrics along 
with their limitations has been given in Chapter 1. In this 
section, the measiires used for selection of programs for 
experiments are discussed. 

A. Program Size: 

The number of executable object instructions have 
been used as a measure of program size [ Schick, 7S] , The 
number of object instructions is machin ©-dependent and 
subjects, in general, do not use machine-dependent features 
to understand the source program. Further, fiiis unit of 
measure as well as Halstead’s measure, [Halstead, 75], of 
program size in terms of operators and operands are at 
low level compared to unit used in cognitive process. Since 
we do not have any qualitative or quantitive basis to 
hypothesize about cognitive unit and their formation process 
in the context of program comprehension, we are unable to 
formalise a better measure. 

It has been shown that the program size in number 
of source statements is strongly correlated with the 
occurrences of actual software errors [Thayer, 75]. In 
addition, a statement of source program is a unit at 
higher level than operator and operands. It appeared 
reasonable to assume a source statement as cognitive unit. 
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Consequently, we decided to use number of source statements 
as a measure of program size. 

In the experiment, factor of program size had two 
levels: small and big . The small programs contained 25-30 
statements whereas the big programs contained 45-70 state- 
ments*. 

B. Control Structure Complexity: 

There are different metrics for control structure 
complexity [Ronback, 75]., [ McCabe, 76], [Woodward, 79],. 

These measures as pointed out earlier, do not reflect the 
overall psychological complexity of the control structure. 
The measures are based on the numerosity of certain features 
of the control structure such as number of basic paths, 
number of 'knots’, number of levels, etc. The numerosity 
affects the interpretation process but not the learning 
process [Lofgren, 77,73], The learning is affected by 
pattern or regularity in the object to be learned, Wone 
of these measures capture this aspect of program comprehen- 
sion. So in our opinion all the above measures are of same 
class, may be with some quantitative difference in their 


* 


We are concious of the fact that the result obtained 
with above classification of size may not generalize 
to larger programs. However, these results will 
certainly be applicable at subroutine or module levels 
which form the basis of all reliable sofjtyag'e. pr|(^^'^j:jj|:s. 


L 63059 
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suitaljility . Further, there is no q-uan titative evidence 
about correlation between cognitive load and these metrics. 

Since we did not wish to add one more metric on the 
basis of subjective arguments, we decided to use the count 
of control statements such as IF' s and DO’ s in a program, 
as proposed by G-ilb, [ Gilb , 77], as a measure of control 
structure complexity for selection of programs for the 
experiments, Gilb has proposed this measure on the basis 
of results of [ Amster, 76]. 

C, Data Structure: 

To the best of our knowledge, we do not know of any 
metric proposed for data structure. This is partially due 
to late realisation of the importance of data structures in ■ 
program development and understanding. The data structure 
plays an important role in building the mental representation 
of the program and in learning. 

The structural property of data is due to relation 
between their elements. The relation between data elements 
may be formally abstracted in terms of predicates and axioms. 
But defining the complexity in terms of number of predicates 
does not appear very sound, because the semantic complexity 
of each predicate or axiom is not same. Further, it will be 
even more difficult to define the complexity of user-defined 
data structures. The user may choose arbitrary number of 
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axioms or predicates to define data structures. In addition, 
the predicates or axioms are not the cognitive units to 

t 

understand data structures. 

The other alternative is to define complexity in terms 
of some property of storage structure or access mechanism. 
However, this leads to a machine-dependent measure. 

In cognitive process, names play an important role. 

The structure of mental r epresentation is expected to he 
defined in terms of these names or objects of problem 
domain which they represent. So it seemed proper to define 
data structure complexity in terms of number of variable 
names needed to define a data structure. For example, in 
the case of array A[i] two variable names are used. 

The language used in experiment was FORTRAH which 
has very limited data structure constructs. The criterion 
used for selection of programs for the experiments was that 
each variable contribute one unit and each vector and matrix 
contribute 2 and 3 units to data structure complexity. 

D. Execution Structure Complexity: 

The execution structure of the program may be 
characterized by crxacution of sequence of assignment 
statements in the program. The regularity of the execu- 
tion of sequence of assignment statement is represented 
by control structure of the program. The only aspect 
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left out is numerosity of the assignments in the program 
and the execution structure of the assignments. Since the 
execution structure of assignment statements is machine- 
dependent feature and is not expected to play any major role 
in cognition process, the complexity measure should reflect 
the two aspects: the number of assignment statements and the 
number of operands needed for each assignment. There was 
no quantitative evidence to choose a measure, ¥e decided 
to use a nominal scale on the basis of number of assignment 
statements in the program for selection of programs for the 
experiments. A preliminary experiment was also conducted 
to investigate factors contributing to execution complexity 
of assignment statements. 

5.3 EXPERIMM 1 : EXPRESSION EVALUATION 

The main purpose of the present experiment was to 
study the effects of number of operands in an (EORTRAN) 
assignment statement and its meaningfulness in some problem 
domain on its executional complexity. ^ source statement 
was selected as unit for investigation because it is expected 
that this is the unit used in cognitive process. 

A. Subject: 

One hundred and five students enrolled in an 
introducrtory programming course (TA-306 ) at the Indian 
Institute of Technology, Kanpur, served as subjects. 
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These students had started writing short (less than 100 lines) 
programs as their laboratory and class assignments. The 
language tnught: in the course was PORTRAIT. The experiment was 
conducted as a 20-minute quiz In the middle of the semester. 

The weighted quiz marks formed part of their final grade. 

' . i ' 

B. Quiz Paper: 

The quiz paper was single page cyclostyled sheet, 
containing the instruction to the student and ten PORTRAIT 
assignment statements. The students were asked to find 
values of variables after execution of assignment statements. 
They were not permitted to use calculators. They could write 
the answer in simplified fraction. They were also asked to 
mark any error found and calculate the value after correction. 
Errors of mixed mode were introduced purposely to pursuade the 
students to read and understand each statement carefully. 

There were ten assignment statements in the quiz. The 
first five statements formed the group of meaningful statements 
and the rest five the group of meaningless statements. The 
meaningfulness of the statements were due to their direct 
relevance to some problem domain and partially due to selec- 
tion of variable names which were suggestive of their meaning 
and purpose. Pirst statement was a formula from mechanics; 
the second was unit conversion formula from Qentigrade to 
Parenhiet and so on. The meaninglessness was due to the 
absence of relevance of statements to any problem domain. The 
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names of variables in this group were not natural, that is, 
they did not suggest any meaning. 

The ten statements were classified in three groups on 
the basis of number of operands in them. The quiz paper is 
reproduced in Appendix 0. 

C. Procedure: 

The quiz was conducted in a big class on a scheduled 
time. The instructor explained the instruction verbally 
to the students. They were requested to seek any clarifica- 
tion about the quiz, -^i-fter this initial rapport, the quiz 
paper was distributed. The students were instructed to 
write their name and roll number on the sheet. They were 
reminded of the allotted time for the quiz. After 20 minutes, 
papers were collected. 

The quiz was evaluated on a 10-point scale. The 
grader had extensive experience in grading student papers. 

The answers written in wrong mode were graded incorrect. 

P, Results: 

An analysis of variance was performed on the data 
C Winer, 71, p. 105]. The difference between mean scores 
for meaningful - and meaningless - group was significant, 

P(1, 104) =4.826, < 0,05. The mean score for the meaning- 

ful - group (M = 3.18) was higher than the mean score for 
the meaningless - group (M = 2.88). This suggests that 
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evaluation of meaningful expression vas easier than that of 
meaningless ones. 

Similar analysis of variance was performed on data 
to investigate the effect of number of operands on evalua- 
tion of expressions. There were three groups of expression 
on the basis of number of operands. The difference between 
the mean score for the first group (number of operands = 4) 
and the second group (number of operands = 5) was statisti- 
cally significant, T(1 , 208) = 57-29, 2<0.05. Similarly, 
the difference between mean score for the first group and 
the third group (number of operands =6) was also statisti- 
cally significant, F(1 , 208) = 34.996, 2 0.05. However, 

the difference between mean scores of the second and third 
group was not significant, ^(1 , 208) = 2.733. The mean score 
for the three groups were 7.76, 5.38, 5.90, respectively. 

These results suggest that the expression evaluation 
becomes increasingly difficult as the number of operands 
increase. This result confirms our general expectation 
about complexity of expression evaluation. However, it is 
important to note that this effect is significant even in 
the case of statements having a few operands as 4,5 and 6. 

3.4 EXPERIMENT 2 ; DATA STRUCTURE OOMPEEXITY 

This experiment was conducted to study the effect 
of data structures and meaningfulness of the program on 
program understanding. Data structures can be expected to 
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affect the complexity of the programs in tvjo ways. Pirst, 
the numerosity and syntactic complexity of data would surely 
increase the complexity of the program. Por example, a program 
having ten variables is more complex than one having just two 
variables. Similarly, a program ^vith many 3-dimensional arrays 
is more complex than one with few vectors only. Secondly, 
complexity is also affected by whether the data names and 
organization chosen are "natural" to the problem domain. 

The selection of "natural" names and organization of data 
will facilitate understanding due to referential meaning of 
data names in the context of some problem domain, -^n example 
of "natural organization" is representation of a chess board 
as matrix. This second aspect is responsible for meaningful- 
ness of the program. 

A. Subjects: 

One hundred and five students of an introductory 
programming course (TA-306) were selected as subjects. They 
were randomly divided into five groups, each consisting of 
21 subjects. As in Experiment 1, the language used in the 
course was PORTRAIT. The experiment was conducted as 1-hour 
quiz, 15 days before the end semester examination. Their 
scores in the quiz were added to the final grade. 

B. Quiz Paper: 

The quiz paper contnined one of the five programs. 

The first program did not have any matrix or vector. It 
has 14 simple variables. The variable names were selected 
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in such a way that they did not carry any positive or 
negative indication of their relevance to any problem domain. 
All names were framed by randomly choosing two characters 
from a set of alphabets. The assignment statements and 
decision structures were also meaningless. That is, the 
assignment statements were constiructed by randomly choosing 
variables and operators, and they did not have any relevance 
to a problem domain. The decision structures were framed in 
a similar manner. This program was thus a collection of 
valid P0RTRA.1T statements without any analog7 with a problem 
domain. 

The second program was a purposeful program. The 
program computed average percentage of seats filled up in 
each section of a course and it also computed average number 
of section per course in a department. The program did not 
have any vector or matrix. It had 15 simple variables. The 
chosen variable names were neutral. They did not provide 
any hint of their relation to a problem domain, 
variables had two characters selected randomly from the 
set of alphabets. 

The third program was same as the second except the 
variable names. In this program, the variable names were 
of six characters and they were selected in such a way that 
names suggested their meanings and association with probiLem 
domain. T'or example, MROLD represented the number of 
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students enrolled in a section of a course and MA TT M 
represented the maximum capacity of the section. 

The fourth program did the analysis of student’s 
bio-data^ After validating some of the input fields, it 
calculated the number of students, the number of males 
and females, and their percentage, average age of students, 
etc. Finally, it p Tinted out complete statistics. The 
program had eleven simple variables and five arrays. The 
variables and arrays names were of two characters and they 
were neutral as in the first and second programs. 

The fifth program was another version of the fourth 
program with only difference in variable names. All variables 
and vector names were suggestive of their meaning and associ- 
ation with the problem domain. 

In all the programs mentioned above, the numerosity 
of data structures and meaningfulness of the program were 
varied. Meaningfulness was due to variable names and 
familiarity of the subject with problems. The first program 
served as control for meaningfulness of the program. The 
fifth program was expected to be most meaningful compared 
to all other programs. 

Each program has one line comment in the beginning 
to set a frame of reference for the subject. 
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0 . Procedure; 

The quiz was conducted in a big examination hall. The 
instructor described the purpose of the quiz and time slots 
allotted for different task. Oyclostyled sheets of instruc-^- 
tion were distributed to them. Students were encouraged to 
seek clarification in case of any doubt. After this prepara- 
tion, programs were distributed to them. They were given 
25 minutes for reading and understanding the program. Blank 
portion of the program sheet was allowed to be used as 
scratch-pad for preliminary calculations. Programs were 
collected after completion of the allotted time. 

In next 10 minutes, they were required to write the 
description of the program in English. The description 
would include what program does and how it does. In the 
last 10 minutes, they were asked to reconstruct the 
program. It was not essential to use same variable names, 
statement numbers and formats in the reconstructed program 
as long as logic of the program remained similar. 

They were also required to mention the error found 
in the program. Some of the errors were intentional and 
some of them were undetected at the time of testing. It 
was expected that location of bugs in the program would 
facilitate the comprehension. The presence of error was 
indicated in the instruction sheet. 
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The quiz was evaluated by an experienced grader. 

Both the program description and the reconstructed program 
were graded on a 10-point scale. The reconstructed programs 
were graded subjectively on the basis of logical consistency 
of the statements reconstructed. The instruction and 
programs are reproduced in Appendix D and E respectively. 

D. Results: 

Separate analysis was performed on scores of program 
description and reconstructed programs. E ratios for the 
two measures were E(4,100) = 7*35 and 11.172, _£ < 0.05 
respectively. Since the results were statistically signi- 
ficant, the hypothesis of equal mean score for all the five 
programs was contradicted. 

To study the effects of numerosity of data structure 
in program and meaningfulness of programs on program under- 
standing, all possible comparisons between total scores of 
each program from program descriptions were made. Total 
scores of all the programs for both measures (program 
description and reconstruction) are given in Table 3.1. 

E ratios for all comparisons of total scores for program 

description are inclded in Table 3.2 along with sum of 

A 

squares and 

TABLE 3.1: SUM OE S SORES OE PROGRAM LES7RIPTI0M kKD 
REOOMSTRUOTIOM 




A 

B 

0 

D 

E 

Program 

Description 

10S 

33 

71 

75 

122 

Program 

Reconstruc tion 

150 

56 

69 

67 

101 



TABLE 3.2 


E RATIOS OE OOMP ARISONS - PROGRAM LES'ORIPTION 


S.No,. 

Comparison 
"between programs 

SSq 

Error 

E 

Ratio 

1 

A - B 

133.928 

7.35 

18.22* 

2 

A - 0 

32.595 

Tf 

4.434* 

5 

A - D 

25-928 

11 

3.527 

4 

A - E 

4.66 

11 

0.634 

5 

B - G 

34.38 

11 

■ 4.67* 

6 

B L 

42.0 

11 

5.714* 

7 

B - E 

188.59 

IT 

25.65* 

8 

0 - D 

0.381 

11 

0.052 

9 

0 - E 

61 .928 

11 

8.426* 

10 

D - E 

52.595 

11 

7.155* 


* indicates statistically significant 
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Differences in total scores for program description 
"between programs A and D, A and E, and 0 and D were not 
statistically sigTiif icant . Program D and ® had more complex 
data structures compared to program A, B and C. This suggests 
that the numerosity of data structures did not hinder the 
program understanding. The total scores of program D and E 
were more than those of B and 0. This further justifies the 
ineffectiveness of the numerosity of data structures. The 
score of program A was more than all the programs except E, 
This result as well as the ineffectiveness of numerosity of 
data structures are contrary to our intuitive expectation. 

The scores of programCand E were more than those 
of B and D respectively. Differences of scores between 
programs B and 0, D and E, 0 and E, and B and D were statis- 
tically significant. These significant differences suggest 

^2jcxS ^ 

that meaningful programs were efi t airc to understand. The 
meaningful data names and organization facilitate understand- 
ing. However, we do not have reasonable argument for higher 
scores of program A compared to programs B,C! and D and so it 
does not seem proper to generalize the above result. 

Similar comparisons were made between total scores 
of each reconstructed program. Table 5.5 lists P ratios for 
all comparisons. The total scores of programs B and 0 were 
less Compared to the scores of programs D and E. The 
difference between scores of programs S and D was not 
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TABLE 3.3 


E RATIOS OE COMPARISONS - PROGRAM RECONSTRUCTION 


S.No. 

Comparison 
between 
pro grams 

SSo 

MSt, 

Error 

P 

Ratio 

1 

A - B 

210.381 

6.22 

33.823* 

2 

A - C 

156.214 

fT 

25.115* 

3 

A - L 

164,02 

?f 

26.370* 

4 

A ~ E 

57.166 

f? 

9.190* 

5 

B - C 

4 .024 

?t 

0.647 

6 

B ^ I) 

2.88 

?? 

0.463 

7 

B - E 

48.214 

f1 

7.751* 

8 

C - B 

0.095 

!T 

0.015 

9 

C - E 

24.38 

t! 

3.92 

10 

D - E 

27.52 

ff 

4 .425* 


* indicates statistically significant 
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significant. These results supplement the conclusion from 
the other measure that complex data structure did not hinder 
the program understanding. 

tt 

The differences between the total scores of programs 
B and 0 was not significant, whereas between D and E was 
significant. The latter result supports the conclusion 
that meaningful data names enhance, understanding whereas 
the former one contradicts it. In addition, the score of 
program A was more than score of all the remaining programs. 
Since the conclusions are contradicting, it will not be 
proper to generalize the result. 

To Compare the two measures of program understanding 
used in this experiment, a separate analysis of variance was 
performed on data. The result was not significant, F( 1,104) 
= 2. S3 and so it may be concluded that both the measures 
are interchangeable. 

A two-way, 5x2, analysis of variance was performed 
[Winer, 71, p.302], to supplement the above analysis and to 
study the interaction between programs and t^vo measures. The 
two factors were program types and measurement techniq.ues, 
with repeated measure on the second factor. 

The P ratios for factor of program type was signifi- 
cant P(4,100) = 9.19, 2 < 0.05. This supports the findings 
of earlier analysis. It is interesting to note that factor 
of. measurement techniiues did produce marginally significant 
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difference, PCI, 100) = 3.90, £< 0.06. It implies that the 
two measures of program understanding are not the same. Ihis 
is in contradiction with results of previous analysis. 

The. interaction between the two factors was signifi- 
cant, I'(4, 100) = 10, S, 2< 0.05. figure 3.1 shows the 
plots of mean scores of different programs as a function 
of different measurement techniques. It is evident from 
figure 3.1 that program description has a higher scores 
compared to program reconstruction for meaningful programs. 
The differences in the scores increase as the meaningfiil- 
ness increases with one exception. In the case of program 
A, the reconstruction score is higher than the description 
score. This may he due to difficulty in learning the 
meaningless computation. The subjects could not discover 
any structure and analogy with a problem domain in the 
process of learning. They found it difficult to describe 
the program briefly in English. In reconstruction, they 
probably used rote memorization and they did better. 

In summary, the numerosity of data structure in 
the program did not hinder understanding whereas the meaning- 
ful data names and familiarity of problem facilitated under- 
standing. The effectiveness of the program reconstruction 
and description measures are function of nature of the 


programs . 



MEAN SCORE 



O«scripilon 

R«construction 


FIG.3.1 COMR^RISON OF TWO MEASURES OF UNDERSTANDING 
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Since, the experiment did not prove the difference 
"between two measures conclusively, we decided to use program 
reconstruction as measure. ¥e also decided to use only 
uniformally meaningful programs for further experiments. 

3.5 EXPERIMM 3 : OOATTROI STRUCTURE AUD EXECUTION" STRUCTURE 
In the earlier two experiments, only one factor was 
considered for study at a time. In Experiment 3, our main 
aim was to study the effect of both control structure and 
execution structure on program understanding. 

A. Design: 

A 3x3, control structure x execution structure, factor- 
ial design was employed. Each factor had three levels — low , 
moderately complex, and very complex . Nine programs (3 =9) 
were constructed corresponding to each com"bination of levels 
of "both factors. The levels of control structure, as pointed 
out earlier, were controlled by number of control statements 
such as IE's, DO's, and computed GO TO's in the program. The 
number of IE statements were 0 - 2 , 3-5, and 6-8 corresponding 
to low , moderately complex, and very complex levels of this 
factor. Eor execution structure, the number of assignment 
statements and the number of operands in them were used as 
controlling factors. All the nine EORTRAN programs were 
purposeful and variable names were meaningful. Eactors of 

size and data structure complexity were held constant. 

\ 

These programs are listed in Appendix E. 
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B, Subjects: 

The subjects were one hundred and seventeen students 
from the same population as in Experiments 1 and 2, They 
were randomly divided in nine groups, each consisting of 
13 subjects. The experiment ws conducted as a 30-minute 
q.uiz a week prior to the end semester examination. 

0, Programs: 

"Mine programs used in the experiment are listed in 
Appendix E. All programs were of 15-20 lines in length. 

One of the program had an array; others had only simple 
variables. 

D. Procedure: 

The quiz was conducted in two big class rooms during 
one scheduled lecture hour. The instructor had discribed 
the purpose of the quiz in the previous lecture. The quiz 
was divided in two parts. In Part A, subjects were required 
to read and understand the program in 15 minutes. To faci- 
litate their understanding, they were asked to evaluate 
values of variables at different points in the program on 
the lines of Weissman, [^eissman, 73]. The first section 
of the part A paper contained the instruction for this 
part. The program formed the second section. The last 
section had the list of variables and line number of 
programs for which values were to be evaluated. The blank 
spaces were left for answers. After completion of Part A, 
the programs were collected. 
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In Part B, the subjects were required to reconsti*uct 
the program which they had read in Part A. The time allotted 
for this task was 15 minutes. It was not essential to use 
same variable names and statement numbers as long as logic 
of the program remained same. Separate instruction was 
given for this part. Both sets of instruotion are given 
in Appendix P. Students were also requested to point out any 
error in the program while reconstructing it. 

The reconstructed programs were graded on a lOO^point 
scale by two methods. In one method, the number of lines 
perfectly recalled was used as measure. In the other method, 
the reconstructed programs were graded subjectively. In this 
method of grading, logical consistency of the statements 
recalled was taken into consideration. Por example, if 
there was a statement, GO TO 30, in the middle of the 
original program and in reconstructed program subject:' had 
perfectly recalled the statement but had put it in wrong 
position then in the second method this was marked wrong. 

The method become subjective because logical consistency and 
related judgement about the extent of understanding will be 
done by the grader. 

E. Results: 

An. analysis of variance was performed on data obtained 
from both Ihe methods of grading- Effects of both control 
structure and execution structure complexities were 
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statistically significant, P(2, 108) = 9.18, 3.52, £ < 0.05. 
However, the interaction was not significant, P(4, 108) = 0.97. 
Similar result was obtained for the other method of grading 
(subjective). 

The absence of interaction between two factors confirms 
the judgemental result reported in Chapter 2, Mean scores for 
programs were plotted as function of control structure and 
execution structure. The two sets of curves in Figure 3.2 
are for the two methods of grading, respectively. It is 
evident from the curves that as the execution structure 
complexity increases the mean scores decreases for all levels 
of control structure. There is, however, one exception. This 
may be due to noise in data. However, it is interesting to 
note that mean score increases with the increase of control 
structure complexity. This is in contradiction with our 
intuitive expectation. 

This Unexpected result may be attributed to efficient 
strategy in the learning process. Two widely accepted meta- 
inferential techniques are inductive inference and inference 
by analogy [ Bob row ,75a » p.20]. Inductive inference uses a 
set of facts to form the basis for a general rule for 
expressing relations. In inference by analogy, if certain 
criteria of similarity are met between two situations, then 
a result that pertains to the first situation can be assumed 
to pertain to the second situation. We do not know which of 
the rules are used in the learning process. 



MEAN SCORES MEAN SCORES 
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A. SUBJEaiVE GRADING 


CONTROL STRUCTURE 



EXECUTION STRUCTURE 
B. GRADING BY LINE RECAlLEi:) 


FIG. 3.2 


CONTROL AND EXECUTION STRUCTURE 
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A closer observation of programs for moderately complex 
and very complex control structure ,hov/ever, reveals that there, 
was a syntactic/semantic regularity in the control logic of 
the program. Por example, the two statements, ^ 

IF(JB1T1 .EQ.1) JSIM = JSUM+1 and 
II’(JBI0)2.EQ.1) JSUM = JSTOi+2 

have a syntactic and semantic regularity. The difficulty in 
learning process is reduced due to this regularity because 
the subjects can easily use the two meta-inf erential techniques 
discussed above. This explains the reason for higher score for 
increasing control structure complexity. 

This result has vindicated our view on complexity 
metrics of control structure. The numerosity of control 
statements or of any feature of control flow will not reflect 
the psychological complexity of control structure. Metric 
has to take into account the effect of regularity on learning. 



CHAPTER 4 


EXPERIMENTS ON PROGRAM COMPREHSNSION 
WITH EXPERIENCED PROGRAMMERS 


The experiments described in Chapter 3 were single- 
factor ones. The third experiment in that chapter was a 
factorial experiment, but it considered only two factors. 
These experiments did not permit the study of interaction 
effects, Eurthermore, the subjects in nil those experiments 
were beginner programmers. The class-room environment and 
strict time schedule for course prevented us from designing 
multifactor experiments and using large programs in those 
experiments. Thus the results from the previous experiments 
are only partially complete. 

This chapter describes two more experiments. The 
first is a four factor experiments having program size, 
control structure, data structure, and execution structure 
complexities as factors. The second experiment studied the 
effectiveness of the subjective rating of the program 
complexities. In both these experiments, subjects were 
programmers with at least 2 years of experience. 

Eirst two sections of this chapter contain the 
details of the two experiments, analysis of data and their 
results. The last section includes the conclusions drawn 
from the experiments reported in Chapters 3 and 4. 
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4.1 EXPERIMEFT 1 : EOUR-EAOTOR OP" C-OMPEi^XITY 

The purpose of the present experiment was to study 
the effect of factors of size, control structure, data 
structure and execution structure complexities on program 
compreh ension . 

A. Design: 

A 2x2x2x2 factorial design was used. Each factor had 
two levels. The decision to have only two levels for each 
factor was taken due to time constraint with programmers. 
Programs were selected for each combination of levels of all 
the factors. Each subject worked on all the sixteen programs. 

B. Subjects: 

Eight system programmers served as subjects. The 
same group of programmers had volunteered as subjects in 
the experiment described in Chapter 2, The characterizing 
features of the group are repeated here for convenience. 

The experience of programmers rangad 'from 2 to 10 years. 

They had a thorough experience in PORTRAT'T and Assembly 
language programming. They also had a good knowledge of 
COBOL and other languages. 

0. Programs: 

Sixteen programs were selected from different text 
books of programming. The programs were modified to meet 
the criteria of selection. The guidelines for selection 
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of programs were metrics discussed in section 3.-2, Program 
of length 25-50 and 45-70 statements were selected as the 
two levels for the factor of size. Programs having 3-5 and 
3-10 logical IF or DO statements represented the two levels 
of the factory of control structure. ^To distinction was made 
between the above two control constructs for the sake of 
de'^iding levels. In the case of data structures, the number 
of vectors and matrices corresponding to two levels were 
0-3 and 5-7. Similarly, for execution structure complexity, 
5-6 assignment statements constituted the lower level. For 
higher level, the number of assignment statements were 
1 - 1 2 , 

Each program had two comments. The first comment was 
for subjects to fix up their frame of reference. The second 
comment included the specification and reference in the coded 
foim for the experimenter. The programs were tested for few 
sets of inputs for each. As the tests were not exhaustive, 
some errors remain undetected. Some other errors were 
introduced intentionally. It was thought that the presence 
of bugs will help to keep reader alert and thus aid under- 
standing. This is particularly important because the subjects 
worked as user consultants at the computer center for at least 
four hours a day where they read mostly programs with bugs. 

The sixteen programs were from different application 
area. These programs are reproduced in Appendix H. 
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D. Procedures: 

Phe experiment was conducted in four sessions, each 
of 2-hour duration. These four sessions were conducted on 
four different days. The experiment was conducted in a class 
room with all programmers simultaneously. 

The experimenter described the purpose of the experi- 
ment verbally to the subjects. Qyclostyled sheets of 
instruction were also d-is^tri but ed to -tlieflj. They were 
requested to seek clarification in case of any doubt. 

The instruction is reproduced in Appendix G. 

Each session was divided in four subsessions, each 
of 50-minute duration. In each subsession, programmers 
received one program selected randomly. Each programmer 
received the same program. There were two reasons for not 
randomizing the selection of programs between programmers. 
Eirst reason was that the programs were of different types. 

It was expected there would not be any learning effect 
from their order of presentation. The second reason was 
that the subjects could discuss each other program after 
the subsession. This discussion was not desirable for the 
purpose of experiment. 

The programmers were urged to read and understand 
the program. Programs were collected after completion of 
15 minutes. In the next 15 minutes, they wrote the comments 
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and reconstructed the programs. In their comments, they 
vifere asked to include a brief summary of the program, any 
error found in the program and their general opinion about 
the complexity of programs. A period of five minutes -was 
given to the above task. It was expected that the exercise 
in this five minutes would make the features of the program 
clearer to the subjects and the subjects would be able to 
organize their idea properly and would do better in program 
reconstruction. In the last ten minutes, subjects were 
required to reconstruct the program. It was not essential 
to use the same variable names, statement numbers and formats, 

A break of 15 minutes was given after t^vo subseggions. In 
this way, the experiment was conducted for four programs 
on a particular day. 

The reconstructed programs were graded on a 
10-point scale by an experienced grader. The grading was 
done subjectively. That is, the logical consistency of the 
recalled statements was the major criterion and the judgement 
about the extent of understanding was subjective. The comments, 
written by programmers, also helped in this grading. The few 
syntax errors that were found did not introdu ce any ambiguity 
as to the intention of the writer. These errors were totally 
ignored. 

E. Results: 

A Sx2x2x2x2 analysis of variance was performed 
[Winer, 71, p. 140]. The five factor being subjects, 
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program size, control struo'ture, data structure, and execution 
structure complexities. As each subject reconstructed all the 
programs, interaction of the subject with a source of variance 
was used as error term in constru -^tion of P ratios [Winer, 

71 , p.202] . 

The P ratios for all main and interaction effects 
are given in Table 4.1. All the factors except program size 
had statistically significant eff' ct on program reconstruction. 
Three 2-way interactions, program size x control structure, 
program size x data structure, and data structure x execution 
structure, were significant. The rest of 2-way interactions 
were not significant. Three 3-way interactions were 
significant. The non-signif ican t interaction was program 
size X control structure x execution structure. The presence 
of these significant interactions is in contradiction with 
the judgemental result of Chapter 2. 

Por graphical analysis, mean scores of reconstructed 
programs were plotted as function of 2- and 3-factors. The 
six sets of curves corresponding to all 2-factor combinations 
are given in Pigure 4.1. Similarly, curves for all 3-factors 
combination are given in Pigures 4.2 and 4.3. 

It is evident from the curves of Pigure 4.1 that 
when the program size is big the, mean score decreases with 
the increase of control structure, data structure, and 
execution structure complexities. Similarly, the increase 
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TABLE 4.1 

P RATIOS FOR POUR PAOTORS AWD IRTERAOTIONS 


S.Ro. 

Factors and Interactions 

P Ratio 

1 

Program size 

0.131 

2 

Control structure 

6.88* 

5 

Data structure 

162.00* 

4 

Execution structure 

GO 

00 

• 

o 

5 

Program size x Control structure 

6.763* 

6 

Program size x Data structure 

50.159* 

7 

Program size x Execution structure 

1.552 

S 

Control structure x Data structure 

3.706 

9 

Control structure x Execution structiure 

0.394 • 

10 

Data structure x Execution structure 

23.140* 

11 

Program size x Control structure x 

Data structure 

6.219* 

12 

Program size x Control structure x 

Execution structure 

0.137 

13 

Program size x Data structure x 

Execution structure 

9.084* 

14 

Control structure x Data structure x 

Execution structure 

9.529* 

15 

Program size x Control structure x 

Data structure x Execution structure 

4.991 


* indicates statistically significant. 
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PROGRAM SIZE 
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CONTROL STRUCTURE 



EXECUTION STRUCTURE, 



EXECUTION STRUCTURE 


KIS REPRESENTS MEAN SCORES IN PROGRAM RECONSTRUCTION FOR ALL CURVES 



EXECUTION STRUCTURE 
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LW 



CONTROL STRUCTURE 


HI 



CONTROL STRUCTURE 


PROGRAM 2IZE PROGRAM SIZE 




» REPRESENTS MEAN SCORE IN PROGRAM RECONSTRUCTION IN ALL CURVES 

FIG. 4.2 3«WAY INTERACTIONS OF FOUR FACTORS 



EXECUTION STRUCTURE 
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DATA STRUCTURE DATA STRUCTURE 


DATA STRUCTURE 



CONTROL STRUCTURE CONTROL STRUCTURE 

V-AXIS REPRESENTS MEAN SCORE IN PROGRAM RECONSTRUCTION FOR ALL CURVES 

FIG. 4.3 3-WAy INTERACTIONS OF FOUR FACTORS 
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of execution structure complexity decreases the mean score 
for all the levels of all the factors. However, in the case 
of program size small , the mean score increases with increase 
of control structure and data structure complexities. Further, 
for lower level of control structure complexity, the mean 
score increases with increase of data structure complexity. 

On the basis of results from the judgemental experiment, • 
the increase of any of the four factors would decrease the 
mean score, i.e., increase the overall program complexity. 
Furthermore, each factor would contribute independently to 
program complexity. That is, there would not be any inter- 
actions. The only set of curves satisfying the above 
expectation is for control structure x execution structure 
complexities. The trend of the other curves do not support 
the above expectation. 

The only conclusive result from the above curves is 
that execution structure complexity affects program under- 
standing and number of assignment statements and the number 
of operands in them are effective measure of this complexity. 
For other factors, due to contradicting evidences, no such 
con elusive .results can be inferred. 

A closer look on these curves reveals that the 
deviation from the expected reauJTt is at one point in each 
set of curves. This point corresponds to program size 
small and lower levels of control structure and data 


structure. Similarly, for other two sets of curves the 
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points Correspond to low levels of control structure and data 
structure and low level of data structure and high level of 
a^TSFC-.iStructrrsspective ly . The comments written hy programmers 
gave enough hint for a possible explan tion of this unexpected 
result. 

Six out of eight programmers had commented that the 
program number 2 (Appendix H, p.l69) was difficult to under- 
stand. Different vreasons were forwarded by the subjects 
for the above difficulty in understanding the program. Three 
OX them indicated that variable REQ causpd to them* 

They could not make out what this variable represented. The 
reasons forwarded by others were insufficient time, presence 
of errors in the program, and unfamiliarity with the problem. 
Similarly, the program number 6 (Appendix H, p.ly^.') was also 
described as difficult. The unanimous reason for the diffi- 
culty was switching of variable JW. They also reported 
difficulty in understanding the program number 10 and 16. 

In program number 10, it was due to nature of the algorithm; 
whereas for program 16, it was due to size of the program 
and unfamiliarity of some of the subjects with statistical 
formulae. 

The reasons forwarded for the difficulties in 
understanding suggest that these programs were more complex 
than they were originally assumed to be. Moreover, the 
increased complexity was not due to the four factors. of-’ corei- 
deration,but due to some external unexpected factors. Due 
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to this increased complexity of the programs the mean scores 
an unexpected deviation. The same reasoning also 
explains the anomalies in the curves of figures 4.2 and 4.3. 

In the light of the above explanation, the results 
can he summarized as the factors of control structure, data 
structure and execution sti*ucture complexities affected the 
program understanding, -whereas the program size did not affect 
The factors other than these four had a dominant effect on 
program understanding. However, these unexpected factors 
remain unidentified at this stage. 

From the above results, it may be concluded that the 
- difference between two levels of the factor of program size 
was not enough. Even longer programs should have been used 
for the experiments. 

It is evident from the comments about variable REQ 
and about the familiarity with formulae and algorithms 
(program 10 and 16) that problem domain knowledge base plays 
a very important role in program understanding. 

4.2 EXPERIMENT 2 : COMPLEXITY RATING 

The purpose of the present experiment was to study the 
ty of the arg-uments forwarded to explain the anomal 
ies in the results of the previous experiment. It was argued 
that some of the programs (2, 6, 10, 16) were more complex 
than they were originally thou^t to be and the source of 



88 


complexity v^as not among the four factors of discussion. It 
was decided to get the complexity of the programs rated by 
different group of programmers. The experiment also provided 
an opportunity to study the effectiveness of the subjective 
rating of program complexity. 

To minimise the subjectivity in the rating, the subjects 
were required to write the summary of the programs after under- 
standing. Subjects rated the complexity of the programs after 
writing the summary and then indicated the factors responsible 
for the complexity as well as their contribution in percentage. 
It was thought that writing the summary of the program, identi- 
fying the sources of complexity, and their percentage contribu- 
tion would minimise the subjectivity in complexity ratings of 
the programs. The details of the experiment and results are 
described in the following sections. 

A, Subjects: 

Seven senior graduate students of computer science 
program at the Indian Institute of Technology, Kanpur served 
as subjects. They had approximately two years of experience 
in FORTRAN and Assembly language programming. They also 
knew COBOL. 

B. Procedures; 

The experiment was conducted in a class room with all 
subjects together. The experiment was divided in four 
sessions of 2-hour duration. Each session has four 
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subsessions of 30-minute duration. The four experimental 
sessi on s were scheduled on different days. 

The task and purpose of the experiment were explained 
to the subjects. Instruction sheets, reproduced in -Appendix I 
were also distributed to them. In each subsession, subjects 
received a different program chosen randomly. All subjects 
worked with same program in a subsession. They were requested 
to read and understand the program. After 15 minutes, programs 
were collected back. They were asked to write a brief summary 
of the program, which they have just read, in next five minutes. 
In the last five minutes, subjects judged the complexity of 
the program. They gave their complexity rating on a 10-point 
scale. They also estimated the percentage contribution of 
various factors to the program complexity. 

A blank sheet of paper was provided to each subject 
for writing the summary. For complexity rating and percentage 
estimation, cyolostyled sheets listing the factors of program 
complexity and with blank space left for answers, were distri- 
buted. 

This procedure was repeated over’dl the subsessions. 

0. Results: 

A 7x2x2x2x2 analysis of variance was performed on data. 
The factors were subject, program size, control structure, 
data stru cture,and execution structure complexities. The 
F ratios were computed as described in the previous experiment. 
Tables 4.2 lists the F ratios for main and interaction effects. 
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E RATIOS EOR COl'CPLEXITr RATINO 


S.ITo. 

Eactors and Interactions 

E Ratio 

1 

Program size 

1 .459 

2 

Control structure 

0.069 

3 

Data structure 

11 .413* 

4 

Execution structure 

22.711* 

5 

Program size x Control structure 

12.692* 

6 

Program size x Data Structure 

11 .743* 

7 

Program size x Execution structure 

11.172* 

8 

Control structure x Data stru'^ture 

15.54* 

9 

Control structure x Execution stru^'ture 

10.159* 

10 

Data structure x Execution structure 

5.448* 

11 

Program size x control structure x 

Data structure 

1.877 

12 

Program size x Control structure x 

Execution structure 

0.264 

13 

Program size x Data structure x 

Execution structure 

2.591 

14 

Control structure x Data structure x 

Execution structure 

39.69* 

15 

Program size x Control structure x 

Data structure x Execution structure 

8.092* 


* indicates statistically significant. 
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For graphical analysis, the mean program complexities 
were plotted as functions of 2-, 3- and 4-factors. There were 
six sets of curves for all 2-factor comhinations. The curves 
are given in Figure 4.4. The curves corresponding to 3- and 
4-factor combinations are plotted in Figures 4.5, 4.6 and 
4.7 respectively. 

It is evident from the curves of Figure 4.7 that the 
program number 2 had highest complexity among all the sixteen 
programs. The ordering of programs on the basis of these 
curves in Figure 4.7 is 2, 10, 16, 13, ... The points 
corresponding these more complex programs are encircled on 
the curves along with number of the program. Except for 
the above four programs, all other programs had comparable 
complexity. 

The trend of the curves, in general, supplements the 
results of the previous experiment. A comparison of Figure 4.1 
and 4.4 reveals some interesting similarity. Sven though the 
subjects for the two experiments were drawn from the distinctly 
different population, the results have a remarkable similarity. 
Subjects of Experiment 1 had scored higher for those programs 
which had low complexity ratings from the subjects of the 
Experiment 2. The difference in the results of the two 
experiments are quantitative but not qualitative. 

The above observation and F ratios suggest two 
conclusions. First, the argument forwarded to explain 
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the results of the previous experiment seems to he valid. 

Some of the programs were more complex than they were 
intended to he. Second, the difference in two levels of 
factor of program size were not enough to affect the 
complexity. Itirther, the numerosity of control statements 
in the program did not increase the program complexity as 
was expected. The significant factors were data structure 
and execution structure complexities (see Table 4.2) 

In addition, the results of the experiment suggest 
that suhoective rating of complexity of programs may he a 
good measure of program understanding provided the subjects 
are given some initial exercise to understand the progrem. 

Encouraged by similarity in the results of two 
experiment, we decided to analyse the summary of the 
programs written by tie subjects in Experiment 2. Summary 
was graded on a 10-point scale. A 7^^2x2x2x2 analysis of 
variance was performed on the score from the summaiy. The 
E ratios for main and interaction effects are given in 

Table 4.3. 

Mean scores for programs were plotted as function of 
•ombiratiouief factors. The six sets of curves for all 
2-factor combinations are plotted in Eigure 4.8. The 
curves corresponding to 3-factor combinations are given 
in Eigures4.9 and 4.10. 
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TABLE 4.3 

F RATIOS PORPROGRAI^ SUMMARY 


S.Ro. Factors and Interactions F 

Ratio 

1 Program size 0.0357 

2 Control structure 1.964 

3 Lata structure 1 .755 

4 Execution structure 0.474 

5 Program size x Control structure 1 .48 

6 Program size x Data structure 5.04* 

7 Program size x Execution structure 2.245 

S Control structure x Data structure 5.54* 

9 Control structure x Execution structure 0,658 

10 Data structure x Execution structure 1.638 

11 Program size x Control structure x 0.46 

Data structure 

12 Program size x Control structure x 12,11* 

Execution structure 

13 Program size x Data structure x 9.33* 

Execution structure 

14 Control structure x Data structure x 0,49 

Execution structure 

15 Program size x Control structure x 3.06 

Data structure x Execution structure 


* indicates statistically significant. 
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A comparison of curves in J'igures 4.1 and 4.S suggests 
that the differences in the curves of two figures are quanti- 
tative only. There is no qualitative difference. This 
quantitative difference in the performance of the two groups 
of the subjects is to be expected. The subjects of the 
se-'ond experiment had a more formal advanced education in 
program development, methodologies, language features, etc., 
than those of Experiment 1 . They are also expected to be more 
talented than the other group of the subjects. It is not 
surprising, therefore, to find that subjects of Experiment 2 
did not find the programs as difficult as the first group. 

4.5 CONCLUSIONS PROM RESULTS OP EXPERIMENTS 

A global view on the results of the experiments 
described in Chapters 3 and 4 suggest some important 
conclusions. 

The meaningfulness of the expressions, data names 
and their organization in context of some problem domain has 
a very dominant effect on program understanding. This 
conclusion is also supported by the fa-^t that four of the 
programs used in the experiments described in this chapter, 
were more difficult to understand due to ambiguity in 
referential meaning of the variable name, U'^ familiarity 
with the problan domain and the algorithm. The selection 
of data names, their organization and the organization of the 
events in program have very important bearing on program 
understanding. 
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The execution structure complexity affects the program 
understanding. This complexity can effectively he -controlled 
hy number of assignments in the program and the number of 
operands in the assignment statement. 

The experiments have brought out the s-lgnificfRnc® of 
data structures in program complexity. Data structure was 
significant in all the experiments described in this chapter. 
However, the results of the experiment 2 of Chapter 3 
indicated the ineffectiveness of the numerosity of data 
structures. This contradiction is probably due to the 
nature of the program selected for that experiment. 

Programs B and C of that experiment were same as program 
number 2 of the experiment of this chapter. Since this 
program had highest complexity "rating, the lowest score 
for that program in the experiments of Chapter 3 be cones 
expected. So, in general, it can be concluded that the 
data structure of the program affects program understanding. 

The subjects of Experiment 2 of this chapter had 
identified data structures of the programs as the second 
major factor contributing to program complexity. They 
identified control structure complexity as a factor 
contributing most to the program complexity. 

The next chapter discusses the formal abstraction of 


data structures. 



CHAPTER 5 


ABSTRACTION OP DATA STRUCTURES 


Various methodologies have been suggested to control 
the complexity due to control structure, data structure, and 
execution structure. The concept of subroutine helped in 
reducing the functional complexity of programs. It is -well 
suited to abstract operations on data. Dijkstra, [ Dijkstra, 72], 
lind Hoare, [ Hoare, 72], advocated the use of only a few 
control and data constructs which are conceptually simple and 
easier to abstract formally, to reduce respectively the 
control and data structure complexities. The impact of 
these methodological suggestions on program understanding has 
not been investigated rigorously. Only in the case of control 
structure, some studies have been made,[Sime, 73 ] >CSheppard,78] . 

The results of the experiments, described in previous 
chapters, indicated the importance of data strutrfeures in 
program complexity. The selection of data names and organi- 
zations which are "natural" in context of some problem domain has 
jprpfotirid' U:ff'^^eto-«n program understanding. However, the data 
structure complexity ^oes not increase with increasing the 
number of vectors and tia tries in the program. The complexity 
metric for this factor should be based on the method of 
abstraction of data structures for mental representation. 
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It was our earnest desire to investigate the impacts 
of methodological suggestions, of structuring and abstracting 
data, on program understanding. But due to unavailability of 
the subjects familiar with the above concepts, the experiments 
could not be conducted. FORTRAN" has very limited data 
constructs. This, further, constrained the experimentation 
on data structures. We decided to pursue the study in formal 
aspects of the specification technique of data abstraction. 

The organisation of the chapter is as follows. The 
first section briefly describes the concept of data abstrac- 
tion and different techniques of specifying it. The second 
section contains a formal definition of completeness and 
consistency of specifications and a proof of their undeci- 
dability. The third section includes an informal comparison 
of the techniques and suggested modification in one of the 
technique. 

5.1 TYPES OP SPECIPIOATIOIT 

-- software system is collection of data and a set of 
operations on them. Its specification is a precise state- 
ment of the requirements that a software system must satisfy. . 
Specification techniques may be classified in two groups: 
operational abstraction and data abstraction. 

In operational abstraction, the semantics of the 
operations are defined by their effects on the state of 
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the data structures. The states of the data structures are 
abstracted in terms of predicates and axioms. The predicates 
•which define the domain and range of the operations are known 
as pre- and post-conditions respectively. In other words, the 
pre-conditions describe a state of data structures in which 
the operations may be started. The post-condition describes 
the state of the data structures after the operation terminate. 
Specification techniques of t hi s group are based on the 
pioneering works of Ployd, [Floyd, 67 1 ,Hoare , [Hoare, 69], 
Dijkstra, [bijkstra, 75 ], Mamm, [ Manna , 74,73], and many 
others. 

In data abstraction, the software system is viewed 
as collection of operations acting on an object or a class 
of object with constraint that the state of the object can 
be known only through invocation of the operations. Operations 
are defined in terms of relations with other operations. The 
data structures of the program are not referred in the 
description. This specification is a 'blackbox' descrip- 
tion and defines only behavioral characteristics of software 
systems. 

The starting point for most of the work on data 
abstraction has been SIMULA 67 [ Uahl , 70]. Many programming 
languages, e.g. [Dahl, 70], [ Liskov, 77]and[Wulf, 74], offer 
a medianism for binding together the operations and the 
storage structure representing a data type. But they do not 
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provide a representation-independer t means for specifying 
the semantics of the operations. Major contribution in the 
development of specification techniques of this class are 
from Parnas, [Parnas, 72, 76, 77], Guttag, [ Guttag, 75,76*78] 
7a Hd Lisirov 'stL al.,[Piskov, 75] . 

5.2 SPEOIPIOATIOM TECHNIQUES EOR ABSTEACT DATA 

A brief introduction of rfpe.cifica-tion tC-Ohniques 
advocated by Parnas, [Pamas, 72, 76, 77 ] and Guttag, 

0 Oat tag, 75 '#76,78], is given below. 

In these specification techniques, a software system 
is viewed as collection of operations acting on an object. 

The set of operations is partitioned in two disjoint sets. 

The operations contained in one of the set merely report 
on state of the object without changing it. These operations 
are named as V-functions and return values that give informa- 
tion about the data objects, [ Parnas, 12 \ The operations 
contained in another set, named as 0-fun ctions , change the 
state of the object without reporting. 

In his ear 1 i er technique, [ Parnas, 72], Parnas 
defined each operation in terms of 'parameters’ (par.), 
'possible values' (pv.), ' initial values' (iv.), 'applicabi- 
lity conditions' (ac.), and 'effects’ (eff.). 'par.' and 
'pv.' give domain and range of the operation, 'ac.' lists 
the conditions which are to be satisfied for successful 
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invocation of that operation. *eff,’ defines the relation with 
other operations, ’iv,' is meaningful only for V-functions 
and gives the initial values to he reported. This notation 
served well for examples in which the execution of an 0-function 
were immediately visible and could be described in terms of 
old and new values of V-funotions. However, in the cases 
where the effects of the call of an O-function are not 
immediately visible in terms of "^-function values until some 
other O-functions have been executed, the notation was not 
rich enough to define such 0-function. Parnas used natural 
language to define such effects, name d as ’delayed effects', 
in his earlier examples [Parnas, 72]. Pfioe introduced ’hidden 
funffition' to specify the delayed effects in his other examples 
[Price, 73] .The hidden functions are not available to the 
outside user, i.e., they can not be called. Their purpose is 
purely descriptive. The effects of some O-functions may be 
defined in terms of values of hidden functions. 

These hidden functions provide information about the 
data structure that stores the state of the object. They 
are suggestive of possible implementations. Since the 
hidden functions give' information whi da is not a requirement, 
their inclusion in the specification goes against the basic 
motivation of representation-independent specification. The 
first step towards writings specification without hidden functions 
la. findrin^ a set.^of callc.Brequence^ aaroh 'tha-t a description of 
observable effects of each sequence in the set allow one 
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to deduce the ohservahle effects of any sequenoe of calls. 
Pamas, [Parnas, 77], named such a set as a characterizing; 
set of function call sequences. However, finding this 
characterizing set for a system is not an easy task. 

Bartussek and Parnas, [ Bartussek, 78] , proposed a new 
technique. The new technique allows the specification of 
modules with delayed effects without any reference to 
internal data structures. 

In the new technique, a specification consists of two 
parts. The first part, called syntax, gives the names of all 
the operations and the type of each parameter. The notation 
to specify the domain and the range of an operation has been 
adopted from [Guttag, 75].. The second part, called semantics , 
consists of three types of assertions. 

The first set of assertions defines a characterizing 
set of function call sequences. These function call sequences 
are named as Legal traces in the new technique. The Legal 
traces are set of traces such that calling the function as 
described in the trace (starting with a module in its 
initial state) will not result in traps. 

The second set of assertions define an equivalence 
relation on traces. The equivalent traces have the same 
legality (either both are legal or both are not legal) and 
the same externally visible effects on the V-fun ctions. 

These assertions extend the class of legal traces. -A-ny 
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trace that cannot he proved legal by above two sets of 
assertions are considered illegal. 

The last set of assertions are about the values returned 
by V-functions at the end of traces. These assertions describe the 
the values returned by V” -functions for a subclass of the class 
of legal traces. The traces used in this last set of assertions 
are known as normal form traces . Using the equivalence state- 
ments, one can derive the values of "V -functions at the end of 
other traces by finding an equivalent normal form trace. 

The details of the notation for syntax and semantics 
of the specification are discussed in [Sartussek, 78]. An 
example taken from [ Bar tussek, 78 ] is given here. 

Example: A stack for integer values. 

» Syn tax : 

PUSH:<Integer>x<3ta ck>-*-< Sta ck> 

POP : <S tack > ^ <S tack > 

T0P:<Stack> -*-<Integer> 

DEPTH : <S ta ck> <I n t e ger > 

Semantics : 

A. Legality: 

(1 ) I (T)=>E(T.PUSH(a)) 

(2) £ (T.T0P) = E(T.P0P) 

B. Equivalences: 

(3) £(T)=>T.DEPTHaT 

(4) £(T) =M.PUSH(a).P0P5T 

(5) £(TiT0P)=*' T.TOPsT 



110 


C. Values: 

(6) ,Z (!)=*► V(l.PUSH(a).T0P)=a 

(7) Jor all integers a, E (T) =*>• 

V(T.PUSH(a) .DEPTH )=1+V(T.D]i5a]H) 

(S) V(DEPTH)=^0. 

In the above specification, ^ (T) ig a predicate and T is a 
trace, E (T) is true if T is a legal trace. Eurther, if T is 
legal trace, x is syntactically correct call on a V-function 
and I (T ,X) then V(T.X) describes the value delivered by X 
when called after an execution of T. »=’ is an equivalence 
relation. Two traces T^ and T 2 equivalent if for any 
subtrace, S, E (T^ .S)= E (T 2 .S) and e(T-| .S.X)=> V(T^ .S.X) 

= V(T 2 .S.X) where X is a V-function. 

The above specification is of a stack with unlimited 
capacity. The first assertion of semantic part states that 
if I (T) is true then S(T.PUSH(a)) will be also true. That 
means if 1 is a legal trace then T.PUSH(a) is also a legal 
trace. So any sequence of PUSH operation is a legal trace. 
The second assertion states that if T.TOP is a legal 

trace then 1. POP is also a legal trace. That is if TOP can 
be called after a sequence T then POP can also be called 
after T. The equivalence section allows one to reducse any 
legal trace with PUSH, POP, and TOP to one that is equivalent 
but contains only PUSH operation. The last section, values, 
defines initial value of DEPTH and values of TOP and DEPTH 
after PUSH operation. 
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The formal basis of specification technique advocated 
by Zilles, [Zilles, 75], and Guttag, [ Gut tag, 75 ], stems from 
the hetrogeneous algebra of Birkhoff and Lipson, [Birkhoff ,70 ]♦ 
In these techniques, an abstract data type is defined as 
hetrogeneous algebra T =[ TOI+I, B] , [Guttag, 75], where 
TOI ("Type of Interest") is the set of all values of the 
abstract data types being defined, I is the set of all other 
defined types and P is the set of operations such that-: 

a) The range of each f e B is contained in TOI+I. 

b) The domain of each f eB is a cross product of members 
of TOI+I and 

c) Bor each fe B, T.OI is either contained in the domain 
of f, is the range of f, or both. 

Like Parnas’ technique, the set of operations, B, is 
partitioned into two disjoint sets. V- and 0-function of 
Parnas are named as 0 - and S-fun ction respectively. The 
specification consists of tv;o parts:' Syntax and Semantics. 

The notation for the syntax is same as of the previous 
technique. Semantic part consists of relations which define 
the meaning of the operation by stating their relationship , 
to one another. The language for the semantic part is like 
a programming language. The following features of the program- 
ming language have been chosen: 

a) free variables, 

b) if - then - else expressions 
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o) Boolean expressions, 
d) recursion. 

A simple example from [ Gut tag, 76] is reproduced here to 
describe the notation for semantic part. 

Example: Type Stack [ Item ] 
begin 

1 . Be dare 

T7EWSTA7K( )-^ Stack 
PUSH(Stack, Item) Stack 
POP(StaGk) -► Stack 
TOP(Stack) •+• Item U { Undefined} 

ISUEWSTACK( Stack) -^Boolean 

2. Por all s e Stack, ie Item Let 

ISUEWSTACK (NEWSTAaK)=True 
ISNMSTA SK (PUSH ( s , i ) ) =Pal se 
POP (NMS TA IK ) zrUMSTAOK 
POP(PUSH(s,i))=s 
TOP (NEWSTA OK ) =Un d ef in ed 
TOP (PUSH(s,i))=i 

end . 

In the above specification, the equations within for all 
and end are the axioms which describe the semantics of the 


operations. 


'Undefined’ is treated as additional value. 
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5.3 OOIMIPLETEKESS km DFSISTEITGY OP SPEOIPIGATION 

A problem associated with these techniques, as 
pointed out by Parnas, [ Parnas, 77], and discussed in detail 
by Onttag, [Guttag, 75 ], is of knowing when one has completely 
specified the abstract data type. Guttag has shown for his 
technique that the problem of establishing whether or not a 
set of axioms is complete is, in general, undecidable, 

Guttag has also suggested a general out line of attack to 
-reate the axioms so that they are complete [ Guttag, 75, 76 ] . 

It is expected that the unde -idability of completeness 
and consistency of the spe -ification will be true also in the 
case of Parnas technique. In this section, a formal definition 
of completeness and consistency is given along with the proof 
of their undecidability. The notation adopted for the proof 
is based on larnas earlier (1972) notation. The reason for 
choosing this old notation instead of recent one, [Bartussek,7S] 
i s that we believe the earlier technique with suggested modi- 
fications to be superior compared with other techniques in 
some of the important aspe-ts. 

A software module is viewed as collection of operations. 
The module can be used only through invocation of these 
operations. The specification defines the module by defining 
the set of all sequence of call of operation which can be 
carried out and effect of these call sequences on state of 
module, starting with initial state of the module. In other 
words, specification defines the feasible and Infeasible 
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sequoncfs of function ’"ialls, starting from initial state of 
the module, and their effe^’ts on the state of the module. 

The state of the module or abstract object is 
characterized by the values stored in the module. These 
values ^an be known only through calls on V-functions. The 
initial values of V-functions define the initial state of 
the module. The call of a V-function merely returns a value. 

It does not change the state of the module. The call of 
0-funottons change the state of the module. This change of 
state is reflected by change in the values returned by call 
on V-functions after the invocation of 0-functions. If for 
all feasible call sequences of 0-functions starting from the 
initial state of the module, the values returned hy V-functions 
are defined and can be derived from the specification, then 
the specification is complete . Itirther, if the value derived 
is unique then the specification is consistent. In other 
■words, the completeness is ensured if states of the module 
have only defined values. 

This definition of aompleteness and consistency of the 
specification is eiuivalen t to that of Bartussek and Parnas, 
[Bartussek, 73 ]. Since the call of V-function do not change 
the state of the module, the effect of a sequence of calls 
of both 0- and V-functions on the state of module will be 
same if calls of V-^functions are removed from the sequences. 

To make it clear, an example is given below from the 

specification of stack. 

PUSH<a ) .DEPTH .PUSH(b >sPUSH<a > .PUSH (b ) 
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This above equivalence, is not in the sense of Bartussek and 

Parnas, [.Bartussek, 78], The meaning of this equivalence is that 
the state of the module, after successful calls of left- and 
right- hand sequence of operation, starting from any state of 
the module, are same. 

The trace defined in [ Bartussek, 7S] , includes both 0- 
and V-functions and so their definition of completeness looks 
slightly different. But with above definition of ’s' , it is : 
clear that both definitions are equivalent. ; 

Definition: 

A sequence of call of operations , g 2 , ..., from : 
the set of 0-functions, denoted as g-j 0 g 20 g|^. . .og^, is said | 

to defined (or feasible^ if successful invocation of g^ may ; 

j 

be followed by successful invocation of g 2 and so on, starting | 

from the initial state of the module. If for all defined i 

sequences of operations (including null) from the set of ; 

0-functions, the values to be returned by call of operations | 

from the set of V— functions are derivable from the specificatio 
and if the values derived is always defined, then the specifi- | 
cation is 'xjmple'e. Purther, if unique values can be deduced ; 
then the specification is said to be co'^sistent. | 

The notation of Parnas earlier (1972) specification : 
technique did not permit the specification of delayed effects ^ 
and hence the feasible and infeasible sequence of operation 
calls. However, this limitation does not create any difficultyj 
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in proving the undecidability of the completeness. The modifi- 
cations in the notation are discussed in the finsl section of 
this chapter. 

We now prove the undecidability of completeness of 
specif i ca tions. 

Theorem : 

There does not exist an algorithm for determining the 
comrletcn ess of an arbitrary specification of abstract data. 

The proof is given in two stages. In first stag^, it 
is proved that for a given Turing ma-'hine, there exist a 
specification such that the specification is not complete iff 
the Turing machine halts. This result is used to establish 
an equivalence between two problems, halting of Turing 
machine and the completeness of the specification technique. 

Proof : 

(a) Let us assume that there exists an algorithm for deter- 
mining the completeness of the specification. That means there 
exists a 'Turing machin©, T, to determine the completeness and 
is guranteed to halt. 

(b) The theorem is proved by showing the non-existence of 
T and for that following lemma is proved first. 

Lemma : 

Por a given Turing machine, there exists a specification 
flWvch that the specification is not-Oomplete iff the Turing 


machine halts. 



117 


Proof : 

In the proof, the notations of [Hopcroft, 69] are used, 
let the given Ihiring machine be 

^ ^ where 

K is the finite set of states, 

r is the finite set of tape symbols including the 
blank ' B' , 

6 is the next move function, a mapping from KxF to 
Kx( r - { B}) X {L,R} and is defined by the set of 
quintiplcs known as rules of 1 ^. That is 
5 = { r^ , r2, . , r^ } where 

^ ~ < 0. j > ot> j d>and eR>cx e r> 3e 

(r -{ B}), and de{ L,R}. 

q^e K is the start state, 

P C K is the set of final states. 

The specification corresponding to T^ is given by defining 
the operations from the set of V- and O-functions, named as 
V-Punc. and O-Punc. The set V-Punc. is defined as 

V-Punc. ={ v^, V2, Vj, v^} where 

v^ returns the present state of T^ as value, 

V2 simulates the tape of T^ and returns the tape symbol 
under the head of as value, 
v^ has only one defined value, ’1’ , and 
v^ gives the position of the head. 
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Each operation from V-Punc, is defined below in the notation 
of Parnas, [Pamas, 72 ], 


V~Puno, : 

^1 

V-Pun 0. 


V2 

par. 

none 

par. 


none 

pv. : 

ke K 

pv. 


s e r 

iv. 

'lo 

iv. 

0 

• 

B 

ac . * 

none 

ac. 

0 

t 

none 

eff . : 

none 

eff. 

m 

m 

none 

V— Punc. : 


V-Pun c. 

• 


par . : 

none 

par. 

* 

none 

pv. : 

{ 1 } 

pv . 

• 

• 

i e set of integers 

iv : 

1 

iv . 

• 

1 

ac. : 

none 

ac . 

• 

♦ 

none 

eff. ; 

none 

eff. 

a 

none 


The set 0-Punc. is defined as 
0-Pun c . = fi lX * 

The definition of operations from 0-Punc. is given below 


0-Pun 0* 

: rj_ e 6 



c. 

•• ^n+1 

par» 

: none 



par. 

; none 





pv . 

: none 

pv . 

: none 








iv • 

: none 

iv . 

: none 





ac. 

: v-i = 14 '^2 

= a 


ac. 

: v^ e P 

eff. 

* u 

• ’ ^2 = 

^ e 


eff . 

; y = ’Undefined’ 
3 


V4 = '''4'+'' If 

d = 

R 

or 



V4 = '^4'-! If 

d = 

I 
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The semantics of symbol ”=" and *' • t i. ^re same as in - 
[ Parnss, 72]. The 'Undefined^ is treated here as special 
value. 

The way specification has been constructed ensures 
that if T^ halts then the specification is not complete. The 
operation can be successfully invoked only when T^ has 

reached in one of the final states and once it is invoked, 

V|j of V-Punc. become undefined. ; 

(c) If there exists T, T may find whether a specification 

i 

is complete. In particular, it may find if the specification, I 
built as above, of an arbitrary Turing machine T^ is incomplete.) 
Thus it may determine (lemma) whether T^ halts. Since it is i 

known that there is no Turing machine which will determine if | 
an arbitrary Turing machine will eventually halt, so the | 

assumption (a) must be wrong. And, hence, the theorem is proved.: 

t 

The proof of the lemma gives some additional information I 
about power of the specification technique. Since there exists j 
a specification for every Turing machine, it implies that , 

technique is powerful enough to specify auy computable : 

function. Selection of suitable V-Punctions has avoided the : 
'delayed effects' in the specification. i 

5.4 MODIPICATIQi'T AWD OOMPARISON OP PARUAS’ TEOHITIQIJE WITH | 

OTHER TECHNIQUES | 

The three techniques considered for comparisons are I 

[Parnas, 72], [Guttag, 75 ], snd [Bartussek, 7S] , and they | 
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ar6 raferred as techni^iue number 1, 2, and 3 respectively 
in this section. 

All the three techniques are powerful enough to 
specify any computable function, ^^he completeness and con- 
sistency of the specifications are undecidable in all the 
three techniques. 'Ihe power, completeness and consistency 
problem of the technique 2 is discussed in [ Guttag, 75], 

The major difference between the techniques are due 
to different schema used for defining the semantics. The 
n"'tation of technique 1 does not permit the specification of 
'delayed effects’. The notation of this technique is not 
rich enough to characterize all feasible and infeasible call 
sequences. Hidden functions have to be used for specifying 
the delayed effects. We suggest some modifications in this 
technique to enhance its notation. 

5.4.1 Modifications: 

The first modification is suggested to facilitate 
the specification of delayed effects. The notation of 
technique 1 does not permit the specifications of infeasible 
sequence of operation and of any special property associated 
with the sequence of operations. 

As pointed earlier, only the sequence of call of 
operation from the set of 0-functions are important because 
they only change the state of the module. Some of the 
properties of the sequence of call of 0-function may not 
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be defined with the definition of individual operation of the 
sequence. Such properties may be specified in terms of values 
of V-function before and after the invocation of the sequence. 
Ihe suggested modification in the notation permit the speci- 
fication of such sequences. 

The notation is given below. 

< Relations >:: = [< Sequence of operation calls from 0-functions 

< effects >] * 

’Relations' signifies the special property associated with 
a sequence. It is defined by naming the sequence and speci- 
f./ing its effect in terms of values of V_functions, The 
values returned by affected V-function may have special value 
'ERROR' to indicate that this sequence is not permissible. 

The execution of such sequence of operation would result in 
error. An example of the notation below is given by defining 
the delayed effect of POP operation on TOP of stack 
[Parnas, 72]. 

Relation : PUSH(a).POP 
eff. : T0P=’T0P’ 

DSPTH=' DEPTH' 

The above specification states that if PUSH(a) operation is 
followed by POP, the value returned by TOP and DEPTH after 
the execution of this sequence will remain same as before 
the execution of the sequence. The fourth predicate of the 
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first example of the first section defines this property. 
Similarly, boundedness of the stack oan be defined under 
this notation. 

The se'‘ond suggested modification is about specification 
of exception handling strategy. In technique 1, the operations 
can be called if and only if the condition listed in ’ac.' are 
satisfied. The module will trap in states where conditions 
are not satisfied. There is no record of previous invocation 
of operation if error has occurred. It is the responsibility of 
the invoker or user of the module to respond to the condition 
of error. The imposition of this ccstraint may be helpful in 
reducing some burden of the implementor from the interfacie 
problems. But it will be preferable to have a feature in 
the specification te-^hnique by which one can define any , 

general exception handling strategy. This aspect has been | 

ignored in both technique 2 and 3 also. | 

I 

In modified notation, 'ac.’ will contain not only the | 
Conditions to be met for successful invocation of the operation ■: 
but also have information about the action to be taken if the , 
Conditions in 'ac.' are not satisfied. The action may include 
passing the information about the source of error, a default 

action or a resume option to the invoker. The proposed i 

! 

notation is given below. ; 
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<ac>::= f^londitiori> |<ac>[< Cod6=iri't6gei> ] * 
j < ac > [< Signal=Boolean Values >]* 
j < ac > [< Del“ault=ac‘tion name>]* ]* 

[] * indicates repitition. 

If ’ac.’ contains only conditions then the operation 
is terminated if these conditions are not satisfied. TTo 
history of call is -kept in the system and no action is taken 
for the exception. If in ’ac.’ conditions are followed hy 
’codes’ , the operation is terminated and an integer code, 
identified with the condition is passed to the invoker. So 
in this case user knows the source of error. In the case of 
signal, the operation will he either terminated or resumed 
under the direction of the user. If some default -action is 
to be taken that also may be listed. The above notation is 
based on the work of G-oodenough, [Goodenough, 75]. 

We expect that the inclusion of these modification 
will increase the range of applicability of the technique. 

5.4.2 Oomparison: 

In technique 2, the semantics is defined in the form 
of set of identities of axioms. Both V- and 0-functions are 
treated exactly in the same way for describing their definition 
This is in contrast with technique 1, where only effected 
operations of V-functions are included in the ’eff.’ section. 
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The notation for semantics, in technique 2, does not limit 
the depth of nesting on "both side of identities. However, 
for constructing a specification which is complete, the 
depth of nesting on left-hand side of identities is limited 
to two only [ Gut tag, 75]- This limitation on depth of 
nesting has necessitated the use of hidden functions ,G-uttag, 
[Guttag, 76' ], have to use hidden function DEPTH. Purther, 
limiting the depth of nesting on left-hand side of identities 
may increase the size of the axiom. 

The notation for the semantic part in technique 3 is 
better in many respect as compared to technique 2. Since the 
technique 5 provides for the definition of all legal sequences 
of function calls, the need for hidden functions are removed. 
There is no limitation, from the notation, on the length of 
the sequence of function calls, the characterization of legal 
and Illegal sequences. 

It is easier to prove the correctness of the imple- 
mentation of data abstracts in technique 2 than in other 
techniques. The duality between axioms and programs has 
been very fruitfully exploited to prove the implementation 
[ Gut tag, 78] . The language used for representing programs 

and axioms are same. IXie to this commonality of language, 
one can view axioms as programs and vice-versa. This duality 
also gives an implementation-direct implementation. This 
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iin pi Bin 6n tQ "ti OTi is in tsxms of "tzs© stiruG'tujrBS gBtiGpa.'tBd by 
V-funotions. The direct implementation can also serve as an 
aid in constructing specifications. 

In other two techniques, to prove the correctness of 
implementation, language other than specification language 
has been used. The operations and their implementations i.e, 
pi^egrams are defined as predicate transformer. The equivalence 
of two predicate transformer is proved with the help of homo- 
morphic mapping from abstract data to implementation 
[ Bartussek, 76], The proof becomes difficult due to use of 
two different languages for specification and implementation . 

The technique 1 has some special features which have 
been ignored in other techniques. The initial values of 
V-functions are defined explicitly. In other two techniques, 
they are defined through axioms and assertions. In our 
opinion, explicit definition is better than the implicit. 

The 0-functions are defined by giving their effect on 
set of V.-functions. The ' eff/ section includes only affected 
V-functions. In situations, where each 0-function affects 
only a few V-fun ctions , this notation will result in small 
size of the specification, ^-'^ith the proposed modification, 
the technique 1 may prove better from readability and under- 
standibility point of view. However, the technique 2 is 
better in regard of proving correctness of implementation. 
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OOlfCLUSIOKS MD SUGGESTIONS 


6.1 CO.^OLUSIONS 

V/e have demonstrated that four algorithm-dependent 
features: size, control structure, data structure, and 
computation structure of the program contribute independently 
to perceived program complexity. The above four factors 
of program complexity affect program understanding. Simple 
syntactic measures of the factors, such as counting number 
of IE statements [Gilb, 77], vectors and matrices, do. Dot 
reflect their contribution in psychological complexity of 
programs. In our opinion, the other measures of control 
complexity are also rather superficial. All these measures 
reflect tlju 3 of certain features of control 

structures such as number of basic paths [Mdabe, 76], 
number of knots [Woodward, 79], etc. They do not take into 
ac'ount regularity or structure in different features of a 
factor. Since human beings are very efficient in identifying 
regularity or structure at various levels, any regularity 
in the features of a factor aid. understanding. The measure 
of Complexity of a factor should take into ac'’ount both 
nuraorosity and regularity of different features of the 


factor. 
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Program features "which establish an association or 
suggest an cCnalogy "with a problem domain, termed collectively 
as factor of ’ meaningfulness' , have an important bearing on 
program understanding. It is evident from the experiments 
that uncontrolled factors in the program contributed more 
to program complexity than the controlled ones. We believe 
that subjective rating may prove to be a good measure of 
program-complexity provided a serious exercise with programs 
such as writing a program description or program modification, 
etc., is carried out to aid understanding. The exercise will 
make the rating more dependable. 

We have shown that Parnas* early specification 
technique (1972) is powerful enough to specify any computable 
function; further, the completeness and consistency of a 
specification are undecidable. We have suggested some 
modifications in the above technique to specify exception 
handling strategy and ’delayed effects' ,[ Parnas, 72], We 
believe that the modified technique has some good features 
which "Would make it more useful than other techniques 
( [Guttag, 75 ], [Partussek, 78 1). 

6.2 LIMIMIOWS 

Programs used in the experiments were selected from 
various programming text-books and were modified to meet the 
criteria of selection. The controlled variation of factors 
of interest and elimination of uncontrolled effects as far 
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3 ,s possiH6 iii&d.s ijh. 6 SGai^ch of pgciI life progDrs-ms for the 
experiments very diffi:;ult. This led us to use text-book 
programs for the experiment. Time- constraint of the subjects 
did not permit the selection of large programs. The above two 
factors may cast some doubt on generalizability of these 
results to real life programs. However, wo believe that the 
above factors do not limit the applicability of these results 
in real life situations. Most of programming methodologies 
eraphr'isize dividing the problem in smaller parts to control 
the complexity- At program level, these small parts are 
subroutines, procedures or modules. The typical size of 
subroutines or modules may range between 20 to 100 lines. 

The overall complexity of the program will depend on the 
complexity of constituent parts or modules* and their inter- 
actions. We envisage the application of the results of 
thf thesis at module levels. For interaction among modules 
and their contribution to overall program complexity, further 
experimentations are required. 

F0RTRA.H, the language used in the experiments, has 
limited data structuring constructs such as vectors and 
matrices. It does not have constructs for records, pointers 
and other interesting data structures. This limitation 
prohibited us from experimenting with various important 
aspects of data structuring end their effect on program 
"’omprehen sion . 
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In some of the experiments, novice programmers served 
as suh;}e(?ts. We did not consider experience as a factor in 
evaluction of expression. The task of expression evaluation 
involves following a mechanical procedure and the effort need- 
ed for this task is expected to show a marginal change with 
the experience Similarly, for Experiment 2 of Chapter 3, 
we expected only quantitative difference in results with 
experienced programmers. In our opinion, the effect of 
experience depends on the nature of task. The performance 
of novi -'Q and experienced programmers did not show any diffe- 
rs nco .i.n ca.. Ox' the ■.^.•r-'^rim'^r't conducted hj'’ [SlinPid ermnn 

6.3 03S571VATIOWS 

Some interesting aspects of program understanding 
and of eKperimontation in them were revealed during informal 
talk with subjects after experiment, from oomiients written 
during experiment and by observing the subjects during the 
period they read and understood the program through their 
rough work. 

In Experiment 3 of Chapter 3, the subjects were required 
to eval'uate the values of variables at different points of the 
program. It was expected that this exercise would aid in 
understanding. Twelve subjects wrote that they could not 
reconstruct the complete program because they studied only 
that se-tion or path of the program which was to be executed 
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for evaluation of values of variables asked for. In case of 
some other sub;jec?ts, it ms evident from their reconstructed 
programs that they also understand only a section of programs 
which was necessary for evaluation of values of variables. 

These comments and observation suggest that evaluation 
of values did aid in understanding. However, the values of 
variables could be evaluated even by partial understand.ing 
of the program. If the evaluation of values of variables at 
diffeient points of the program is to be used as a measure 
of program understanding, then the care should be taken in 
selection of variables and points in the program for their 
evaluation. These variables and points of evaluation of 
their values should account for all possible paths of the 
program. Various algorithms reported in the literature for 
test data selection may be applied to decide about variables 
and the points at which their values ar» to be evaluated. 
Without such precaution, the performance of the subjects in 
evaluation of the values may not reflect their understanding 
of program. 

In the experiments (Experiments 1 and 2 of Chapter 4), 
subjects used the blank portion of the' program sheet as 
scratch-pad for some calculation, drawing flowcharts, 
comments, etc. These rough works were indicative of their 
activities during program reading a^nd understanding and 
revealed partially the different methods of abstracting 
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information about the program. One of the subjects always 
drew the complete flowchart of the program he was studying. 
ITwo other subjects used to identify the section of the 
program with some problem domain dependent events by tracing 
the control blocks of the program. For example, the subjects 
had identified a blo"'k by drawing ali'-'e from start of DO 
loop to its end and wrote 'initialization of variables' near 
the line. The complete program was divided in many su-jh 
blocks. One subject always started making tables correspon- 
ding to data st.ru Hu re of the program. Table had as many 
entries as the number of simple variables read through 
I/O statement. Entries consisted of variable name and 
their format. Similarly, for vectors and matrices he would 
draw a separate line structure. He identified various 
events of the program with '•hanges in the above structure. 

The scratch-pad work mostly contained the line structures, 
variables names and their values at different stages of 
the program and comments. Such preliminary exercises of 
the rest of the subjects did not exhibit any pattern. Their 
scratch-pad contained only f^w calculations, some variable 
names, etc. 

The experiments were not designed for investigation 
of various methods of abstraction in program understanding. 
The scratch-pads of the subjects were also not very infor- 
mative. It will be premature to conclude about methods of 
abstraction from above observations. 
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In Experiment 1 of Chapter 4, subjects indicated in 
their comments and through informal talk after experiment 
that some of the programs were more difficult. The source 
of increased complexity, as identified hy the subject, were 
not due to controlled factors of the experiment. In one 
program {^ q , 2 ) the difficulty was created hy one variable 
(REQ). Many of the subjects could not make out the purpose 
of the variable and what it represented. In another program 
(No, 6), the complexity was due to switching of a variable, 

J¥. Similarly, in some other programs (No. 10, 16) the complo 
r.iti-op. v;oro due to unfamiliarity of the subjects with 
algorithm and formulae used in the programs. These various 
uncontrolled factors had a very significant effect on program 
understanding. 

The above observation is a pointer to the need of 
elimination of uncontrolled factors as far as possible for 
the success of experiments. 

6.4 DIREOTION EOR FURTHER WORK 

The observations of the previous section are suggestive 
of the direction of the future work, 4.s we have pointed out 
earlier, the various metrics of complexity of program factors 
do not reflect their psychological complexity. There are no 
measures for complexity of data structure and computation 
structure of programs. Efforts should be directed towards 
defining meaeures for various factors of program complexity 
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based on validated theory of program comprehension. Experi- 
ments should he designed to study various methods of abstrac- 
tion employed by human beings in program understanding. 
Eormatio*! of mental representation of programs, strategies 
adopted for learning the program and the impact of various 
metnodologioal suggestions on these processes shoxild "be 
thoroughly investigated. 

Various measures of program comprehension such as 
hand simulation of programs, responding to questionnaires 
about the program, memorization of the program, etc. have 
been used in experiments reported in the literature. Among 
all such measures, program memorization has been used most 
videly and accepted as a good measure. However, no comparative 
evaluation of different measures has been done. The superiority 
of a measure should be established through experiments. 

Results of the experiments in program comprehension 
are of qualitative nature. Qualitative results do help in 
identifying the problem but they do not provide any quantita- 
tive basis for enginineering design and standardisation. Eor 
example, the quan tifi cation of optimal size of subroutine, 
the depth of nesting of IX) loops, number of Boolean variables 
in a conditional statement, etc., from program understanding 
point of view, will give a sound basis for good engineering 
designs. 
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I'inally , ©ffort should he directed towards standardising 
the development of programs for the experimentation to eliminate 
uncontrolled factors as far as possible. Various syntactic 
transformations may he employed to get different versions of 
programs for experiments. 
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APPMDIX A 


STIMULUS GOMBIMIONS POR PRA.GTIOE 


ML ANCHORS 

Program size 
Control structure 
Data structure 
Exeaitional complex. 


PRAl 

= Very small 
= Extremely simple 
= Extremely simple 
= low 


PEA2 


Program size 
Control structure 
Data structure 
Executional complex. 
PRA.GTIGE EXiiMPLES 


= Very tig 
= Extremely complex 
= Extremely compl ex 
= Very higli 


PEA3 


Program size 

= Small 

Go-otrol structure 

= Simple 

Data structure 

= Simple 

Executional complex. 

= low 


PRA4 


Program size 
Control structure 
Data structure 
Executional complex. 


= Small 
= Simple 
= Very complex 
s Moderate 
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Program size 
Control structure 
Bit a structure 
Exeoutionil complex. 

Program size 
Control structure 
Bata structure 
Execut tonal complex. 

Program size 
Control structure 
Bata structure 
Executional complex. 

Program size 
Control structure 
Bata structure 
Exe cu tional compl ex. 

Program size 
Control structure 
Bata structure 
Executional complex. 


PRA5 

= Small 

= Moderately complex 
= Very complex 
= Low 

PIIA.6 

= Moderate 

= Moderately complex 
= Moderately complex 
= low 

PM7 

= Moderate 

= Moderately complex 
= Moderately complex 
= Moderate 

PM8 

=> Big 
= Simpl e 

= Moderately complex 
= Low 

PIIA9 

= Big 

= Moderately complex 
= Moderately complex 
= High 



Program size 
Control structure 
Bata structure 
Exeoutional complex. 


= Big 

= Very complex 
= Very complex 
= High 



appendix b 


judgemental experiment 


INSTRU mON 

We are engaged in developing a measure for the 
oomplexity’ of programs. This measure can Decoupled ■with a 
suitable sof'bw.are reliability models for predicting such 
things as the number of errors remaining in the software 
package, the expected time to next error, etc. In this 
experiment, you have to give your judgement about the 
program complexity on the basis of various information 
given to'ycu about the program. 

The program caaplexity may be characterized by a 
combination of four different dimensions of •the program. 

The four dimensions are: 

(i) Program size complexity 

(ii) Control structure complexity 

(iii) Data structure complexity, and 

(iv) Execution structure complexity 

These complexities can vary continuously. But for 
convenience, 'we have divided each complexity in three levels 
The program size may be specified in terms of number of 
different symbols and their frequency of occurrences in the 
program. The symbols are variable names, operator symbols, 
punctuation marks, different language constructs, etc. The 
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program size is divided in levels, small , moderate , and big . 
One may assume 10 lines program as small and 500 lines program 
as big 

The oontrol structure oomplexity is due to logical 
decisions in the program. Each logical decision introduces 
a separate possible p:ath of computation. The control 
structure complexity may be defined in terms of number of 
possible paths in the program and the number of elonentary ' 
Boolean values used to identify each path. Three levels for 
this complexity are simple , moderately complex , and very 
complex * A straight line program or basic block may be an 
example of s imple . The programs having large number of 
control constructs and associated Boolean values can be 
consid-red as very complex . 

The data structure characterizes the relation between 
the data elements. Some example of data structure are array, 
stack, queue, files, records, etc. The complexity of data 
structure may be characterized in teimis of variable names 
needed to define the data structure. In the case of array 
two variable names are needed to form this data structure. 

The three levels of this complexity are simple , mod era tely 
complex , and very complex . One may consider a EORTRAK" 
program operating on only single variables as having a 
data structure complexity zero or extremely simple. On 
the other hand, a COBOL program with wide variety of files 
and records is an example of very complex data structure. 
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The execution structure complexity emphasizes the 
severity of each assignment in the program. The executional 
complexity may he defined in terms of number of operands 
needed for the execution of the statement and the number of 
assignments made to variables. The three levels of this 
complexity are low , moderate . and high . 

You consider the information available about all the 
four dimensions of the complexities of a program and then 
judge how complex is the program along a 31-point scale. 

The scale is made up of 31 equally spaced holes on a 
wooden frame. All holes are identical. The leftmost hole 
on the scale represents very low program complexity and right- 
most hole on the scale represents extremely high program 
complexity. Read information about one program, judge its 
complexity and insert the wooden peg in the hole which 
represents the complexity of the program considered. 

You will be judging 137 different programs. In some 
cases you will know only two dimensions of the program ; in 
other cases you will have information about all the four 
dimensions. You try to base your judgement of the program 
complexity on the supplied information only. That is, 
when the information about two dimensions are available, 
use that information alone; when information about all the 
four dimensions are supplied, use all of than in making 
your judgements. 
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To make you familiar with the nature of the task, use of 
response scale and your role as oudge, we shall give you 10 
practice examples. You work with all these examples one hy 
one. During the practice period, feel free to ask questions 
if you have any. After the practice period, take one card 
at a time, read the typed information pertaining to the 
program and then judge its complexity, finish judgement of 
all the 137 cards in this manner. 

KEHIMBER that you are judging PRO&EAM OOMPIEXITY 


Thanking you. 



APPMDIX G 


EXPRESSION EVALUATION 

IITOIAN INSTITUTE OF TEOHNOLOGY, KiVNPUR 
COMPUTER CENTRE 

TA-306 Lecture Quiz 3 FeL. 1979 

Art of Computation 

Given the values of variables: 


A = 16.0 N1 = 5 

CENTGR =40.0 N2 = 8 

H = 5 .0 N3 = 2 

R = 10.0 N4 = 10 

U = 20.0 MN = 6 

T =5.0 N5 = 4 


Find the values of variables calculated by assignments below 
in thr fornL of simplified fraction. Numbers produced by 
cal -viators are not acceptable. Mark any error found and 
calculate after correcting. 


(i) 

S=U*T+0.5*A*T*T 


S= 

(ii) 

FARNHT=32 . 0+CENTGR*9 . 0/5 . 0 


FARNHT= 

(iii) 

MEAN=(N1 +N2+N3+N4 )/N5 


MEAN= 

(iv) 

V0LUI4=3 . 1 42*R*N*H/3 1 0 


V0LUM= 
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(v) 

AVERAGE (N1 +N2+N5+N4 )/N5 

AVERAG= 

(vi) 

XQ=S*a+T*U/(R+H) 

SQ= 

(vii) 

MXS=(N1+N2)*(N3+N4)/N5 

MXS= 

(vilj) 

Z?0=X*'r+5*T*Mlf 

ZPC= 

(ix) 

TSX=100.0*A+R+5.0 

TSX= 

(x) 

PT}S= H+R+T*U/10.0 

PT’S= 



iiPi-ENDIX D 


EiiTii STRUCTURE COiiPLEXITY 


li^DL'U'f INSTITUTE OE TECUNOIOGY, KMPUR 
COMPUTER CENTRE 

TA-306 Lecture Quiz Y April 1979 

Art of Computation 


INSTRUCTIONS 

This quiz is mf^'Oat to test your ability to understand 
EORPR;iN programs. The oompleto quiz is of one hour duration. 

In first part, read carefully the program given to 
you and try to understand it. You have 25 minutes to do it. 
Make sure you write your name and roll number on the program 
sheet, because ea 'h one is being given a different program 


to understand. 

After 25 minutes, we will collect the program sheets 
and distribute answer books. Write a description of the 
function (what the progrim does) and algorithm (how It does 
the work). This task his heen allotted 10 minutes. If you 
did not understand some point about the program or fma an 
error in it please mention that in your description. Question 


obout errors will no*'; be entertained. 

,T +Vi e nf thp 10- minutes allotted to 

When we announce the ena oi - rv- 

d a.r +V, T +1^fnrk sud wTlte thc whole program 

description writing, stop that worn uinu 

V • f. T+ ia not necessary to get trivial details 
as you rerae?raber It is noe i 

T-n omcia < 3 tatpment numbers, formats, etc. 
su -h as exact variable names, statem 

. 4 . -well vou remember the working 

We are only interested in how wen yu 


of the program. 



ilPPMDIX E 

PROGRAMS EOR EAOA STRinPURE COMPLEXITY EXPERIMETifT 


PROGRAM A 


C NUMERICAL COMI’UTA'T’IONS 

M=0 

5 REii-D 10,AB,AX,CG,YP,LZ, CQ,MR,ZS,NT,BS 

10 P0RMAT(/,P3.2,I1,P4,2,I1 ,E4,2,I2,P4.2) 

IP(RT.EQ.O) GO TO 50 
DX=ji.fl+ eg 
EH=DX4- yt 
PRINT 15,EN 
15 I<'0RiLi.T(lX,E8.2) 

IF(MR.GT.I) GO TO 25 
GZ=:AX+BS 
H0=GZ*AX+10.0 
PRIifP 20, HO 
20 POR!yLVT(lX,E10.2) 

GO TO 40 

25 IP(MR.GT.2) GO TO 35 
BL=CG+ZS 
CQ=CQ+BL 
PRINT 30, CQ 
30 P0RMAT(1X,E3.2) 

GO ‘^0 40 

35 IF(MR.GT.3) GO TO 50 
BN=LZ*MR 
CL=CQ+BN 
PRINT 30, CL 
40 NM=NM+1 
GO TO 5 
50 STOP 
ENL 
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PROGRi'J^ B 

C 02 l/i-LOULii-TIOu OP PERCEi'lTii.GE OP SliiTS PIILSD UP 
MEGER DT,OS,DO,GO,SP,RQ,i'lX 
RS.D 5,£P 
5 FORMAT (a1) 

RM 1O,DT,0E,ST,RQ,MX 
10 P0R[M(A4,I2,I2,2I3) 

15 IT=1 
D0=DT 
00= OE 
NS =1 
20 NT=1 

S0=PL0AT ( RQ ) /PIOAT (MX ) 
miD 10,jDT,UE,ST,RQ,MX 
IP(OO.NE.GE) GO TO 25 
i'IT=I'lT+1 
IT=IT+1 

SC=SC+PLOAT(RQ)/FK)AT{MX) 

GO TO 20 

25 IP(DO.NE.ET) GO TO 35 
NS=NS+1 

aG=1 00 .0*SC/PL0AT (NT) 

PRIiTO 30, PO, CO, AG 
30 P0RmT(lX,A4,2X,l2,3X,F8.2) 

CO=OE 
GO TO 20 

35 SG=PEiPiT(lT)/PLOAT(NS) 

PRINT 40,]X),SG 
40 P0RMAT(1X,A4,2X,PB.2) 

if(pt.eq.kp) stop 

GO TO 15 

STOP 

END 
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PROGR/\M C 

C 02 OAiOULATim OP PERCaSiTTAGE OP SEATS PILLED UP 

INTEGER DEPT,GOUiSB,DPTOLD, OOROLD,SE^TON,ENROLD, MAXIM 
DATA KSTOP/IH*/ 

REilD 1 0, DEPT, COURSE, SESTOH , ENROLL, MAXIM 
10 PORMAT(a4,I2,I2,2I5) 

15 IS1CT=1 

DPTOLD=DEPT 
COROLD=COURSE 
NC0RS=1 
20 NSECT=1 

SETP ITsPLOAT ( EN’ROLD)/PLOAT (MAXIM ) 

RILiD 1 0 , DEPT , COURSE, SECTOR , ENROLL ,MAXIM 
IE(COROLD,NE- COURSE) GO TO 25 
RSECT=RSECT+1 
ISE0T=ISECT+1 

SETPCT«SETPCT+PLOAT(ERROLL )/PLOAT{MAXIM ) 

GO TO 20 

25 IP(DPTOLD.NE.LEPT) GO TO 35 
RCORS=NOORS+1 

A VERilG=1 00 . 0*SETP CT/PLOAT (NSEOT ) 

PRINT 30,DPT0LL, COROLD,AVERAG 

50 P0RiMiiT(1X,A4,2X,l2,3X,P3.2) 

OOROLL=OOURSE 

55 SAVERG=PLOii.L(ISECT)/PLOAT(RCORS) 

PRINT 40,DPT0LL,SAVERG 
40 P0H4AT(1X,a4,2X,P8.2) 

IP(DEPT.EQ*XSTOP) STOP 

GO TO 15 

STOP 

END 
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PROGRkM D 

C 04 MALYSIS OP STUDMT BIO DATA 

IHTEGER R0,AE,':;T,SE,I)E,SX,MD,SG,SI)(2) ,0B(2),MI(2),PY(20),! 

SM(3) : 

miA N1^,SG,SD,0D,ML,P7,SM/51*0/ 

10 RPLvI) 15,R0,DE,AE,SX,0P,SE,MD 

15 P0mi\T(l3, II ,12,211 ,12,11) 

IP(RO.BQ.O) GO TO 50 
Ip(DE.EQ.0.0R.PE.Gr.5) GO TO 50 

IP(SX.EQ.0,0R.SX.GT.2) GO TO 50 ; 

IP (RO.GT. 500)00 TO 50 

I 

SG=SG+ji.E 

SI)(SX)=SI><SX)+1 

CD(OT)=OD(OT)+1 I 

HL(MD)=ML(MD)+1 

PV(SE)=PV(SE)+1 I 

SM(DE)=SM(I)E)+1 

GO TO 10 I 

30 AG=FLOAT(SG)/PLOAT{m) I 

AM=1OO.0*PIOAT(SP(l))/PI;OAT(MM) I 

AP=1 00.0*PLOiiT (SP(2) )/PIOAT(jM) | 

PRINT 35,NM,AM,AP,AG | 

35 P0IMAT(1X,I4,3(2X,P8.2)) | 

PRINT 40,(OI)(I),MI(I),I=1,2) 

40 POR:4iLT(1X,2(I3/),2(l3/)) I 

PRIilT 45,(PV(I),I=1»20) ,(SM(I).,I=4,5) | 

45 FORMATdX, 20(15/), 3(13/)) | 

50 STOP I 

EKE 
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program e 


C 04 AilA LYSIS OP STUDENT BIO-DATA 

INTEGER ROLNUM , AGE, CAST , STATE, DEGRE, SEX, MARIED, SUMAGE 
1 SEX COD ( 2 ) , OSTOOD( 2 ) ,MiLRITL ( 2 ) ,PR07N 20 ) , S CHEiyr( 3) 

D-'.Tji NUMBER, SUMiLGE, SEX COD, CSTG0D,MARITL,PR0VN0,SCHEM/31*0/ 
10 HEiID 15 ,ROLi'{UM,DEGRE,AGE, SEX, OAST,STATE, MARIED 
15 PORM/xT (13, II ,12,211 ,12,11 ) 

IP(ROLNUI4.EQ.O) GO TO 30 

IP(DEGRE.EQ.0.0R.DEGRE.GT.3) GO TO 50 

IP(SEX.EQ.0.0R.SEX.GT.2) GO TO 50 

IP(R0LNUM.GT.500) GO TO 50 

Nl]MBER=NUM8ER+1 

3TJvLiGE=SUI'liiGE+AGE 

SEX COD ( S EX ) =S EX COD( SEX ) +1 

CST00D(CAST)=CSTC0D(CAST)+1 

WiRITL (MARIED )=M/IRITI (ILIRIED )+1 

PROViJC( STATE )=PROVNC (STATE)+1 

GO TO 10 

50 AVGAGE=PLOAT ( SUMjIGE )/ FLOAT (NUMBER) 

A VGMilL=1 00 .0*PL0iiT(SEXC0D( 1 ))/EL0AT(NUMBER) 

AVGML=1 00 ,0*FL0jiT( SEXCOD( 2 ) )/ELOAT (NUl'IBER ) 

PRINT 35 , NUMBER, aVGMaL,AVGML,AVGaGE 
35 P0RI4AT(1X,I4,3(2X,P8.2)) 

PRINT 40,(CSTaOD(I),MRITl(l),I=1 ,2) 

40 P0RI4AT(1X, 2(15/), 2(13/) ) 

PRINT 45,(PR0YNC(I),I=1 ,20), (SCHEM(I),I=1 ,5) 

45 P0RMAT(1X, 20(13/) ,5(130)) 

50 STOP 
END 
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CONTROL STRUCTURE MB EXECUTION STRUCTURE 

QUIZ TA 306 

roll i'JO. NAME; 

The quiz is divided in PART A and PART B. You are given 
PxiRT A first and you have to answer this part in 15 minutes, 
jiftcr the completion of PART A you will he given PART B to 
answer in 15 minutes, 

PART A 


Time: 15 Minutes 
Maries: 5 

Read carefully the program given helow and try to 
understand it. The questions in PART B are also related to 
this program. The input data for the program is given at th e 
end of the program. (This data is for PART A only). You are 
required to give the value nf variable, or print-out (if it is a; 
WRITxx statement) at diffc-'ent lines of the program. The space ; 
and the format for the answer is also given in the sheet* I 

(NOTE: The programs of the Quiz are on the following pages). ; 



PROGRilM 11 


PROGRiiM 

: Calculation of gross-pay. 

01 


WRITE(6,15) 

02 

15 

POm T ( 1 X , *ID-NU!4BPR GR0SS -PAY* ) 

03 


TGP=0.0 

04 

20 

REaD(5,21) id, REGHRS, 0YHRS, Ri\TE 

05 

21 

P0RMAT(I5,3P5-0) 

06 


IP(ID.EQ.O) GO TO 2S 

07 


GRSPAY=R/lTE* ( REGFIRS+1 .5*0VHRS ) 

08 


WRITE ( 6 , 25) ID, GRSPAY 

09 

25 

P0RNL\T(1X,I7,P15.2) 

10 


TGr=TGP-^RSPAY 

11 


GO TO 20 

12 

2S 

WRITE(6,30) TGP 

13 

30 

PORi4iiT(3X, *TOTaL*, P15.2) 

14 


STOP 

15 


END 


Input data 
)5l750>5)iS4a.)i^>55.W7. 


LINE NO 

05 
01 

06 
07 
OS 
10 


Vj.RIaBLE/OUH^UT 

TOP 

OUTPUT 

ID 

GRSPAI 

OUTPUT 

TGP 


YALUE/PRINT-OUT 
0.0 (Example) 
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PR0GRMJ2 


PROGR/tM: Computation of grade-point. 

01 ¥Ria!B(6,1) 

02 1 KiRmT(1X,*R!Z5LlW)^)!yifEIGHTTziT6TOSPI]jJ)5)^]^^ 

03 10 RE/iD(5,11 ) JRj6LL,0I.lPI,aUMlArT 

04 11 .P0RMiiT(l5,2Pl0.1 ) 

05 IF(JR0LL.EQ.O) STOP 

06 REitD(5,15) WT1 ,GR1 ,WT2,GR2,WT3,GR3,¥T4,GR4,WT5,GR5 

07 15 P0M*..T(10P5.1 ) 

08 SMWT=WT1+WT2+VJT3+WT4+WT5 

09 SPI=(¥T1 *GR1 +¥T2*GR2+¥T5*GR3+^W4*GR4+WT5^R5 )/SIMWT 

1 0 CPI=(0LCP I*CUiyi¥T+SPI*SEMWT )/ ( OUMWT-eSEMIfT ) 

11 WRITE(6,16)JR0LL,SMWT,SPI,CPI 

12 16 POimT<1Xl,I5,3F10.1) 

13 GO TO 10 

14 end 


Input data 

6O43200)5)6>5)50S. O)!i0W04S . 0 

012, 0008 .0012. 0006 . 001 2. 00 1 0 . 00 1 2 .0008.00 1 2 .0008 . 0 


LIRE NO 

VARIABLB/OUTPUT 

YAIUE/PRINTOUT 

03 

OUIWT 

48. (Example 

03 

JR0LL 


06 

¥T4 


08 

SEM¥T 


09 

SPI 


10 

OPI 
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-pROQRm 13 

PROGR^iM: Calculation of compound interest. 

0 1 R3LID( 5,10 ) AMFT , IYEilR 

02 10 P0RIi-iT(Fl0.0,I2) 

03 IP(aMNT.GT.10000.)G0 to 15 

04 R.ITE=5.0 

05 IMi4TH=1 2 

06 GO TO 20 

07 15 RilTE=8.0 

OS MTH=6 

09 20 N=IYEiiR*12/IMNTH 

10 30 IF(il.EQ.O) GO TO 40 

1 1 jlOI JM= ( RATE/ 1 00 . )*AMNT>t TlWOH 

12 hMi'IT^AMNT+AOUM 

1 3 N=H-1 

14 GO TO 30 

15 40 TOITE(6,45) AMNT,IYEilR 

16 45 PORiliT ( 1 X , *A 00IM4IATE]> AMOUNT* , PI 0 . 2 , *APTEB^ ,12) 

1 7 STOP 

1 S END 


Input data 

)iT6y '5^53000. )55 


LINE NO 

VaRIaBLE/OUTPU t 

VALUE/PRINTOUT 

07 

RriTE 

5.0 (Example) 

03 

AMl'IT 


08 

IMTH 


09 

N 


15 

AOUM 


15 

AMNT 




PROGR/iM 21 

PROGRAM: Conaputat ion of discouni;. 


01 


RR'J)(5,1 )KTYPE,PRIOE 

02 

1 

P0RM/iT(l1 ,P10.2) 

03 


GO TO (10 , 20,30),KTYPE 

04 

10 

IE (PRIOE.GE. 5000.0) GO TO 45 

05 


GO TO 40 

06 

20 

IP(PRI1E.GE. 2000.0) GO TO 45 

07 


GO TO 40 

08 

50 

IE(PRITE.GE. 1000.0) GO TO 45 

09 

40 

DIS0T=0.0 

10 


GO TO 50 

11 

45 

DISCT=0.1*PRIOE 

12 

50 

j J4TDUE=PRI CE-DISO T 

13 


WRI TB(6 , 5 1 )PRI 7E,RTYPE,I)IS0T , AMTDDE 

14 

51 

EOPMiiT< 1 X ^E10 . 2 , 1 2 , 2E1 0 . 2 ) 

15 


STOP 

16 


E14I) 


Inpu t data 

2 ) 1 ^) 6 ) 52000,00 


LIRE ¥0 

V.iRIaLBLE/OUTPUT 

VALUE/PRINTOUT 

01 

PRIOE 

2000.00 (Examp 

03 

KTYPE 


15 

PRICE 


11 

DISOT 


12 

AMTDUE 


13 

OUTPUT 




PROGRiiM 22 
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PROGRiiM: Computation of time in ilM/PM from ’hours' 

01 RE^D(5,1)JRLYTM 

02 1 PORM;i.T(l4) 

03 JHRS=JR1YTM/100 

04 JMI^=JRLYTM-100*JHRS 

05 IP(JHRS.GT.12) GO TO 20 

06 IP(JHRS.RQ.O) JHRS=12 

07 WRITE(6,10)JHRS,JMI1T 

OS 10 FORMAT(lX,I2,*.*,I2,*]rfAM*) 

09 STOP 

10 20 JfIRSr=JHRS-12 

11 IF(JHRS.EQ.O) JHRS=12 

12 WRITE(6,21) JRRS,J]yiIlT 

13 21 F0RMAT(U,I2,*,*,I2,*]rfPM*) 

1 4 STOP 

1 5 END 


Input data 
2300 


LINE NO 

V.iRlABLE/OUTPUT 

ViMJJE/PRINTOUT 

01 

JRLYTW 

2500 (Example) 

03 

JHRS 


04 

JMIN 


12 

JHRS 


12 

JMIN 


12 

OUTPUT 




PROGRi^ 23 


PROGRAM: Galculition of sum of digits in a number. 


01 


RE,iD(5,1 )mJM,NI)IGS 

02 

1 

EORI4Aa](2I6) 

03 


K0UNT=O 

04 


JSUM=0 

05 


NDMIN1=NI}IGS-1 

06 

5 

N1=NIJM/10 

07 


lE(NlJM.EO.O) GO TO 20 

08 


JDIGIT=NUM~N1*10 

09 


IE(JDIGIT,GT,0) JSIIM-JSIJM+JmGIT 

10 


NIM=N1 

11 


K:0UNT=K0UiNT+1 

12 


ip(k; 0 unt.lt.nmini) go to 5 

13 


JSIM=JSUM+NUM 

14 

20 

¥RITE(6,21 )JSUM 

15 

21 

FORMAT (IX, 16) 

16 


STOP 

17 


FiID 

Input d 

ata 

b967S5WW 

LINE NO 

YARIABLE/OUTPUT VaLUE/PRINTOUT 


03 

K0UNT ® (Example) 


05 

NMINl 


06 

N1 (in first iteration) 


OS 

JLIGIT (in first Iteration) 


06 

N1 (in seoond iteration) 


14 

JSUM 
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PROGRAM 31 

PROGR/.M: Computation of first an^ seoond marks of the olass. 

01 I’fTEGER M/IRKS(10) 

02 DO 1 1=1,10 

03 1 REiD(5,2) WiWiS(l) 

04 2 F0RMi\T(I6) 

05 ffflIBSD=0 

06 DO 5 1=1 ,10 

07 IP(Mj\RKS(I).Ga:.MHIEST) MIIIEST=MiBICS(l ) 

08 5 COiNfPINlJE 

09 M2xraHI=0 

10 DO 15 1=1,10 

11 IP(Mi't.HKS(I).BQ.MHIES'r) GO PO 15 

12 IP(l4iiRKS(l).GT.M2iOTIl) M2:roHTr-M4RK.S(l) 

1 5 15 00i4TiyUE 

14 WRITE (6, 20) MHIEST,M2i'1DHI 

15 20 FORMA PC IX, 216) 

1 6 STOP 

17 SilD 

Input data 


40 

IINE NO. 

V.iRIABLE/OUTPUT 

UAIUIi^PRIWTOUT 

20 

05 

MHIE3T 

0 (Example ) 

10 

09 

mrk;s(4) 


18 

14 

MARKS (8) 


90 

14 

MHIEST 


80 

14 

M2NDHI 


12 

14 

OUTPUT 



35 

70 
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PROGRAM 32 

PROGRikM: Conversion of binary number to decimal number 

0 1 RS.a) ( 5 , 1 0 ) JBI TO , JBIT1 , JBIT 2 , JBIT 3 , JBI T4 

02 10 P0M/iT(5I1) 

03 in5C=0 

04 IP(JBia?4.EQ.1 ) I3}EC=IBBG+1 

05 IP(JBIT3.EQ.1 ) II)EC=II)E0+2 

06 IP(JBIT2.EQ.1) IDEC=IDEC+4 

07 IP(JBIT1 .EQ.1) IDEa=IDPO+S 

OS I P( IDEC . GT . 9 ) STOP 

09 IP(JBIT0.EQ.1 )G0 TO 20 

10 WRITE(6,15) IDEO 

11 15 FORMAT (1X,*+*, 11 ) 

1 2 STOP 

13 20 ¥RITE(6,25) IDEO 

14 25 P0m'lAT(lX,*-*,I1 ) 

15 STOP 

16 END 


Input data 


00101 



LIITE K ). 

VARIa RLE/OUTPUT 

vahie/pRii^totjt 

03 

IBEO 

0 (Example) 

04 

IBEC 


05 

IBEO 


06 

IBEC 


07 

IBEC 


10 

OUTPUT 
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PR0GR.J>1 53 


PROGHtM; Oalculation of number of One's in a binary number 
(Parity check), 

01 mD(5,10) JBI'P0,JBIT1,JBIT2,JBIT3,JBIT4 

02 10 P0RMAT(5I1) 

03 iC0UNT=O 

04 I P( JBI T1 . E4 . 1 ) .Ji0UKiE4:0U:^T+1 

05 IP(JBIT2.EQ.1 ) K0Ulfr=4;0Ui’IT+1 

06 IP(JBIT3.EQ.1) K0UHT=:iC0UIT+1 

07 IP(JBIT4.EQ.1 ) K0IINT=.C0UNT+1 

08 N=K0UNa!/2 

09 K0DI)=!C0UW'r-2*N 

10 JBITP=0 

11 IP(K0D]).T5Q.1 ) JBITP=1 

12 IP(JBITO.EQ.JBIIP) STOP 

13 WRITE(6,15) 

14 15 F0R'^T(1X,*PARITI ERROR*) 

1 5 STOP 

16 END 


Input data 


01010 



LINE NO. 

VARIABLE/ OUTPUT 

VAIUE/PRINTOIIC 

03 

1C0UNT 

0 (Example) 

05 

JBIT2 


06 

JBIT4 


08 

N 


09 

I0DD 


12 

JBITP 
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QUIZ lA 303 

ROLL NO, NAME: 

Part B 

Time = 15 Minutes 
Marks = 15 

This part of the quiz is to test your understanding 
of the program. If you have understood the program in PART A, 
it should not be difficult for you to recall the same program. 
Try to recall the program from your memory and write it in 
the space given below. It is not necessary to remember exact 
variable names or statement numbers as long as the logic of 





appendix g 


FOUR-FACTOR EXPERIMENT 

INSTRUCTIOi^S 

This experiment is being conducted to study the factors 
affecting prograra co>Tiprehension* The experiment is divided into 
four sessions, en-h of two hours duration. In each session 
you will wor^: on four i'^dependent programs spending half hour 
on each program. There will be a 15 minutes break after two 
programs. As each program is given to you, pl^se read the 
program careful Ly and try to understand it. You should try 
to find out (i) what the program does and (ii) how it does. 

You will be given 15 minutes for this work. After 15 minutes, 
we shall take back the program from you. In next 10 minutes 
you have to reoonstruct/recall the program which you have 
read. If you have understood the program it will not be 
difficult for you to reconstruct it. It is not necessary 
to use same variable names and statement numbers, which 
has appeared in the program, as long as logic of the program 
remains the same. At the end of the reconstruction task 
you write in English about the program. So in total of 
30 minutes, you spend 15 minutes in program reading and 
understanding, 10 minutes in reconstruction and 5 minutes 
in commenting. Thi s process will be repeated for other 
three quarters of the session with different programs. 

If you find any error in the program please do point it out 

m.. vppti itou fresh tea will he served in break, 
in your comments. To keep you ixeou 
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PROGRAM FOR FOUR-PA OPOR EXPERIMENT 


PROGRAM 01 


C 


01 


10 

15 


20 

25 


50 

40 

45 


3=24 C=05 D=02 E=06 REP PM 576 

y/iL lULATION OF i'fUMBER OF ¥ORDS IN SENTENCES IN A PARAGRAPH 
DIMENSION iaHAR( 300 ) ,NCHAR( 20 ) 

DATA N CHAR/ 20 * 0 / 

DATA IST 0 l%Ii 3 Li^K/ 1 H. , 1 H / 

READ 10 ,I 0 HAR 

FORI'IATCsOaI ) 

1 = 1 

IF(I.GT.SOO) GO TO 40 
IF(rCHAR(l),NE.I' 3 MK:) GO TO 20 
1 = 1+1 
GO TO 15 
NOOHR^I 
1 = 1 + 1 

IPd.GT.SOO) GO TO 30 
IP(ICHAR{I).BQ,I 3 LNK) GO TO 30 
IP(ICHAR(I).EQ.IST 0 P) GO TO 30 
N 0 CHR=N 0 CHR +1 
GO TO 25 

N CHAR ( NOCHR ) =NOH.AR (NO CHR ) +1 

1 = 1+1 i 

GO TO 15 I 

PRINT 45 ,( 1 ,^^®!^^^^)'^='''^°^ f 

F 0 Rj 4 AT( 1 Xd S.NO FRF>Qra 0 YV 2 O(tXd 5 , 6 X,l 3 /)> I 


STOP 

END 



V vJ O 


PROGRAM02 


02 S=:2B 0=03 D«0 E»12 PM 3S1 

OO'iP’ITA'PION OP AVE^\GP NO OP SECTIONS PER COURSE IN A 
'■3T]'ART’'1R<T Al-ro AVERAGE FERCEi'fTAGE OP SEAT VACANT IN EACH 

INTEGER DEI T, COURSE, I>EPT0,C0URS0, SECT, REQ, MX 

Dn TA i(ST0P/1H*/ 

HMD 10,DK1 T, COURSE, SECT, RSQ, MAX 
10 FrHmT(A4,l2,I2,2I5) 

15 T3ECT=1 
Dll’! TOnDia T 
:onRGO»C01JRSE 
A lOHGr^l 
NSE1T=.1 

31‘CT=PL0AT ( REQ )/PL0AT(lMX) 

20 REaD 1 0 , DEI' T , COU R 3E , SECT *REQ ,MAX 
EP(C0UR30,HE. COURSE) GO TO 25 
N3E CT=iJSECT+1 
ISECT«ISECT+1 

SI CT^GPCT+PLOAT ( RIQ )/PL0AT (MX) 

GO TO 20 

25 IP(D.EP'TO.NE,DEPT) GO TO 35 
NC0RS=NXRS+1 
\VG=.100.*SPCT/FL0AT(NSECT) 

PRINT 30,D:^T0, COURSO,AVG 
30 POaMAT(U,A4,2)C,l2,3X,P8.2) 

COURSO=COURSE 
GO PO 20 

35 SiiVG=FDOAT(ISEOT)/PLOAT(NCORS ) 

PRINT 40,DEPT0,SAVG 
40 PORMAT(U,M,2X,PS.2) 

IP(D“!PT.EtJ.KSTOP) STOP 
GO TO 15 



I RCXiRAM 03 
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C 03 S=27 :^06 B=05 E=06 REP M 133 

0 Ui DATLii/ IHE STO 3K- BOO.C WITH iW ARRITaI AilD SHIPMENTS 

ISTK(10),I4U.U^(10),UNIT(10),PRIIE(10),SAMNT(10) 
RE. I D 10,(1 STK ( I ) , I ( I ) , W IT ( I ) ,PRI SE( I ) , 1= 1 , 1 0 ) 

10 POR!-l.T(2r5,2PQ,2) 

15 20 .i^3TiC,iiJUiiN,UPRI0 

20 FOR!-UT(213,FB.2) 

JF(**:S?K,EQ.O) CiO TO 30 

DO 25 r=:i,io 

IF(.n"'K.JE.ISTK(I)) GO 'TO 25 

T ■ J ,iN ( I ) *.» 1 01 1 AR ( r ) +NQUAN 

I-RT 'K(T )=PRI'1E(I)+UPHI7*PL0AT(NQUM) 

UN IT ( I ) =?RIOF( I )/PLOAT(IQIJAN (I ) ) 

25 70WTINUE 

on TO 15 

30 RPiiB 35,NSTK,NQUM 
35 F0mMT(2I3) 

IP(N3TK.E0.0) GO TO 45 
DO 40 1 = 1 ,10 

IP(N3TK.NE.ISTK:(I)) go to 40 
SAI4ji1 T( I ) =UNIT ( I )*PDOAT (NQUAN ) 

IQU.U4 ( I )=IQUAi< ( I )-i'fQUAN 

PRI TE( I )=PRI JB( I )-UNIT (I )*PLOAT (NQUM ) 

40 OOi^TINUE 

GO TO 30 i 

45 PRINT 50,(lSTiC(l),IQUAi^(I),UNIT(l),PRI7F(l),SAMNTtl).I=1.-K^ 

50 P0RW.T(10(1X,I3,2X,I3,3(1X,PS.2))) 

STOP * : 

END 
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HfaiTW-! 04 


04 


10 

1‘3 




'>0 


40 


‘>n 


S = 27 0=04 D=05 E=10 REP ME 
nJ.i LYSIS OP STUDENTS BIO-DATA 

I NTEGER aOLNO , aGE , OaST , STATE , DEGRE , SEX ,I4ARD , SAGE , 
SEOODC 2 ),7S00D(2),M.iRTL(2),ER0V(2O),S0HIM(3) 
a.TA AUM , SAGE, SECOD, OSCOD,mRTL ,PR0V , S OHEM/31 *0/ 
.■(F...D 15 ,ROk<0, DEGRE, AGE, SEX, 1AST, STATE, mRD 
?OHi'InT(l3,I1 ,12,211 ,12,11 ) 
tF(R0LN0,S4.0) GO TO 30 
IP(DEG {E.EO.O.OR,DEGRE.GT.3) GO TO 50 
IF(3KX.E'4,0.0R.SEX.GT.2) GO TO 50 
rF(H0L40.GT.999) GO TO 50 
.UiM=AlJM+1 
OaGE-SuGE+AGE 
OH 10D( SEX )=:SEC0D(SEX)+1 
r/XjD( SAST)=OSOOD{ 0AST)+1 
"ITL( :4iiRD) =MRTL (mSD ) +1 
3 1 0 V ( S TATE ) =?ROV ( STATE ) +1 
0 0 'IFIW ( DEGRE ) =S 1IEIJI( DEGRE ) +1 
'lO 'lY) 10 

avgag»float(sage)/eloat(num) 

AVGM»1 00.*FL0AT (SE00D(1 ) )/FLOAr(MUM) 
viVGPai oo.*float(segod(2))/ploat(niim) 

. :RTNT 35,HUM,A7GM,A7GF,AVGAG 
K1‘.1'*1AT(U,I4,3(2X,ES.2)) 

3 RI.^T 40,m,(lSCOD(I),MARTL(l),I=1,2) 

FnH,4HT(1XI,I4 ,2X,2(I5/ )) t a 

iHlAT 45,^1014, (PR0V(I),I'=1. 20), (SGHM(I), - , 
F0H;4 .iT ( 1 X , 14 , 2X , 20 ( 13/ ) , 3 ( 13/ ) ) 


OTOV 

F-iD 


Q5 s=26 %06 D=03 E=05 KSI HOi^E 
•;} DATUG OF LIB. STOCK 
T)DlFASIOi^ DPTL(5) 

T.J?i=r^FA DCOD,SCOD,T)Enoi)(5),SUBCOD(5 ,10) ,ACONO 
f*TOTAL,DrTL,D]i!PXD,SUBC0I)/6*0. 0,55*0/ 

HRvD 15, ACCr^O,NOVOI, PRICE, I)COD,SCOI) 

15 F^R>1AT(I4,I2,F8.2,2I2) 

rF(ACCiiO.E.).0) GO TO 30 
I F(ACCAO.GT.g999) GO TO 20 
IF(An/CL.E,!.O.OR.AOVOL.GT.99) GO TO 20 
IKd’RTCE. EO. 0.0. OR. PRICE, GT. 2000.) GO TO 20 
'I’l HI CE4^RI0E»PL0ATU0V0L) 

(JTOTAL^^GTOT.vL+TPRICE 
rF(I):0i).Esi.O.0R.DC0D.GT.5) GO TO 20 
Dl TL ( D COD ) =DPTL (DCOD) +TPRI OE 
DEI' COD( DCOD) =D1R COD( DCOD ) +i10V0L 
IF(SCOD.EQ.O.OR.SCOD.GT.IO) GO TO 20 
no .nOD( D COD , S cod ) =SUB C0D(D CID , SCOD )+i'^OVOL 
GO TO 10 

20 IHIAT 25,.iCCi10 

?S FORM.i'"( IX, TERROR Ix^ DATA FOR 100110=*, I6) 

•CO TO 10 

30 PRIGT 35,GTOTAL 

35 FnmUT(lX,*GR.'UlD TOTAI=* ,P8 . 2) 

Vil.lT 40,{BEPXB(I),I=1,5),((SUBT01 i(I,J),J=1,-IO),I=1.5) 

f.il Ki WATdX, 5(15, 2'«, 10(15, 2X)/) 


1 HfjGflw-I 06 
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7 06 3 = 26 ^=07 D=01 1=12 HEP PWM 378 

Z OOI'll UTATlOiJ OP THE AVERiiGE i'fO. OP WORDS lA THE SEWTMCE 

mimsiOii lA(so) is para. 

Mil I3 LiJK,IPER/1H ,1H./ 

.,WD.3=0 

•I =0 

DO .10 1=1,10 
r.iD 10, lA 
H F0it-i;iT(90Al ) 

.N-rO 

J=1 

IS IP(i;i(,l).*^l.IPER) GO TO 20 
1 F ( . il 1 , 0 ) if •rDS=i'fWDS+1 
.l3Bi=NSlH+1 
GO TO 30 

20 lF(rA(J).E3.IBLi'iK} GO TO 25 
JW=1 

GO TO 30 

.5 IF(JV/.EM) MS=MS+1 
JWiO 
50 

IF{J.LT.80) GO TO 15 
IP( JW .14.1) AWDS=i'lWDS+1 
40 OOaTIAUE 

..VG=FLOAT (iJWDS )/PLOAT (ASEN ) 
paii^T 45 ,hVG 
•Hi F0R:i*T(1^,F3.2) 

STOP 

liJD 
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: l.O? D=05 E =07 R^P PM I 95 

1 •::!£ 1 RnG:i.'vM Ei'IGIPHERS (copes) a message. 

r .iCEGKH KEY ( « ) ,MESAG( 16), S Y!'1B0L( 2? ) ,NKEY( 8 ) , CRYP (16) 

.• . lA .dYM 80L/1H »1Hii,1.:B,1HC,1HD,1HE,1HP,1HG,1HH:,1HI,1HJ,1HK, 
1 ^•^^-.‘'■'••^»lfii'J,1HO,lHP,1HQ,1HR,1HS,1HT,1HU,1HV,1HV7,1HX,1Hy,1HZ/ 
AK.i.O 10,KEY,MESAG 
U' E ' t:AT(‘-1ii1/16ii1 ) 

:/ 20 

1 / 15 .< == 1,27 

IK(KKTfJ).E\^SYlBOI(K)) GO TO 20 
IS 

:'C loo 

.1^0 

IK) 5S 1^1,16 
4.4+1 

IF{J,E4.9) J=1 
DO 29 

IF(MK4,vG(L).E'J.3YM30L(K)) GO TO 50 
/S :o.<TT.-K;rE 
on "'0 100 
50 ;‘;.»K+AKEY{J)+9 

M M-27*(M/27) 

5S ■:RY1’(L)=SYM‘30L(M+1) 

JHIii'P 40 , 4RYP 

1 0 FC'jiM.-iT {Mf* CRYPTOGRAPH* , 5^ , 1 6A1 ) 

1 n 0701 


iirjGM 08 
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7 08 :U21 3=07 D=05 1=06 RIP imu 

i ROORiWil 00Ui<T3 THE OP SBIBOLS MD ITS PREQUMCY 

3 lOi'l I7HAR(300) ,ISIMBL(27) ,IPUH0(3) ,IPREQ(27) ,IPC01TT(5) 
: .TA XSra3L/1H 

D<'iTA H'UWC/1H,,1H. ,1H-/,M,IPRSQ,IPOOiIT/1 ,30*0/ 

READ 10,I7HAR 
10 FOWiT(10(B0A1/)) 

15 TF(AUM.GT.QOO) GO TO 40 

1 taO 

DO ?0 T = 1 ,27 

IP( IO'IAR(dllM).ilE.ISYMBL(l)) GO TO 20 
ISWasI 

IF «E^{I)=IFRED(I) + 1 
.’•'J XAOXliiUE 

IF(ISW.E.i.1) GO TO 35 
DO 25 1*1 ,3 

IF(I mR(AUi4).Hl.IPUi)fO(l)) GO TO 25 
I.‘5W = 1 

ri'30*jT(I)=IPOOAT(I)+1 

1F( ISW.AE.O) GO TO 35 
4 HL<T 30, lOHARCimM) 

1.0mT(U,*WV.LLID ■■mR*,2X,Al) 

35 .d;:4=aum+i 

GO TO 15 

l.UJT 'l5,(IP>WJ(I).I=''>2T),(IP'TOiT(I.),I-1,3),HOM 

?oi«AT{U,27(1X,lV).'l^>5(''^>I5/),1X,I4) 

ATOP 


PHnOHii :-! 09 
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^ j *) 0 = 59 /=05 DaOl E =06 REP DVT 215 

* 1 ■'ICnjRiiM FOR COMPUTING ¥ATER OOICPARY BILL 

D L S I OXi ( ililAME ( 5 ) 

10 ' ii-jvD "iS » ( iUlME ( l ) , 1=1 ^ 5 )^ x \ f 0 UST , MRI,MM 
15 i *' ORi 5 .. T ( 5 ii 4 , 3 I 4 ) 

IF (* nU 3 T . Eij . O ) GO TO 90 
1 ’lIoT 20 ,(. alMS ( I ), I = i , 5 ) 

20 ?0 it'UT (IX, , 3X,5ii4 ) 

IF(iMU3T.GT.1000) GO 10 60 
i'ATOT 25 , iTOl]ST 

.‘5 F ') HM . T ( lX ,-* 0 US 7 . iTO .*, 2 X , I 5 ) 

lF ( MrtL , aT . 9909 } GO TO 70 
j u.i r 50 ,MRL 

5 i ^ ? n : t - r * T ( 1 X,*UST meter RMIiTO *, 2 X , I 5 ) 
IF ( MR .<. LT . MRL . OR . MRL ' f . GT . 9999 ) GO TO 80 
VRIuT 

!- T ; lMAT ( lX,»NW METER mDIMG *, 2 X , 15 ) 
rnOED « iMRIi-MRL 
I U.iT 40 ,IUSED 

•’.0 FO R -' L.T ( IX , » AMOUi^T USED ^ , 2 X , 15 ) 

IF(IIJS D . GT . IOO ) GO TO 45 
1 = 0 . 1 * F 10 iiT ( lUSEI ) 

m 4 iJT ?: 30.0 

.0 TO 50 

A . WT 1 = 0 , 1 * 100.0 

;.:.HT2=0. 15*EL0AT(IUSED-100) 

Of ;.M<T=.iMi^T1+AlWT2 

i HIaT 55,^iMx^Tl ,iiMi1T2,AMi1T 

05 FuH : 4 ii . T ( U ,* T 0 TAl AT RATE 1 *, 2 X , P 6 . 2 / 1 X ,* T 0 TAL AT SATE 
1 F 6 . 2 / lX ,* T 0 Li.L ■; iURGE *, 7 X , E 7 . 2 ) 

GO TO 10 


(continued) 


PROGRAM 09 (oontinued) 


60 PRINT 65,N0UST 

65 P0RMAT(1X,*PRR0R IN OUSTOMER NO.*,2X,I5) 

GO TO 10 

70 PRINT 75 ,MRI 

75 PORMAT (I X, *ERROR IN LAST METER READING* , 1 X , 15 ) 
GO TO 10 

BO PRINT 35, MEN 

85 PORMAT (IX, *ERROR IN NEW Z4ETER RPADMG* , 1 X , 15 ) 
GO TO 10 
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PROGRAM 10 

0 10 S=40 0=06 D=03 E=10 REP PWM 190 

0 PROGRAM TO PLOT A GRAPH OH PRIITER 

DIMERS lOH X(20),Y(20),IP(50) 

DATA IA,IB/1H*,1H / 

READ 10, N 
10 P0MAT(I2) 

PRIRT 15,R 

15 P0RMAT(1X,'*R0. op P0IRTS*,2X,I3) 

READ 20,(X(I),Y(I),I=1 ,R) 

20 PORtIAT(2P4.1) 

PRIRT 25 
25 P0RMAT(1H1 ) 

1=40 

30 DO 35 K=1,50 
IP(K)=IB 
35 OORTIRUE 
DO 40 J=1 ,R 
IY=Y(J)/2.5+0.5 
IP(IY.RE.I) GO TO 40 
IX=X(J)*10.0+0.5 
IP(IX)=IA 
40 OORTIRUE 
M=I/10*10 

IP(M.EQ.I) GO TO 50 
PRIRT 45, IP 

45 P0RMAT(11X,1H*,50A1 ) 

GO TO 60 

50 YP=2,5*PL0AT(I) 

PRIRT 55,TL,H> 


(coatiQued) 



PROG-RAM 10 (continued) 


55 P0Riy[AT(lX,P6.2,2X,*+*,50Al ) 

60 1=1-1 

IP^I.GT.O) GO TO 50 

irp=OTO 

PRI^T 65, IP 

65 PORMAT ( 1 X , PS . 0 , 2X , *+* , 5 ( 1 0H**********+ ) ) 
DO 70 1=1 ,6 
X(I)=I-1 
70 CONTINUE 

PRINT 75,(X(I),I=1 ,6) 

75 P0R1^AT(2X,6PS.1 ) 

STOP 
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PROGRAJyi 11 


C 11 S=44 0=03 I>=05 E=07 REP ttOITE 

C PREPARING LIST OP STIIDMTS IN EACH DEPARTMEilTS OP INST. 

LMEi'i'SlON LEPT(5 ) ,ANAMS(5 ) ,N0DPT(5 ) ,N0PRG(3) ,NPIN(3) 
MTA ATENL ,i40LPT , ISM,M/1H*, 7*0/ 

10 READ 15,(DEPT(I),I=1,5) 

15 PORnAT(5A4) 

IP(DEPT(1 ) .EQ.ATEj-ID) GO TO 90 

M=M+1 

PRINT 20 

20 P0RMAT(30X,*LIST OP STUDEtJTS*) 

PRINT 25, (DEPT (I), 1=1, 5) 

25 P0RMAT(10X,5A4) 

DO 30 1=1 ,3 
N0PRG(I)=0 
30 NPIN(I)=0 

35 READ 40 , JROL , (ANAME( I ) , 1=1 , 5 ) , IPOOD , IPOOD 
40 P0MAT(I6,1X,5A4,3I1 ) 

IP(JROL.EQ.O) GO TO 50 
NODPT (M ) =N'ODP T (M ) + 1 
NOPRG(IPGOD)=NOPRG(IPOOD)+1 
NPIN ( IPCOD)=NPIN ( IPOOD) +1 

PRINT 45, JR0I,(ANAME(I),I=1 ,5 ), IPOOD, IPOOD 
45 P0RMAT(I7,3X,5A4,3X,I1 ,3X,I1 ) 

GO TO 35 

50 PRINT 55,N0DPT(M) 

55 PORMAT(1X,*TO'IAI no. op STUDEL'fTS=*,I3) 

PRINT 60,N0PRG(1) 

60 P0RMAT(1X,*N0, OP STUDENTS IN B.TEOH .=*,13) 

PRINT 65,N0PRG(2) 

65 P0RMAT(1X,*N0.0F STUDMTS IN M.TE0H.=*,I3) 

PRINT 70,N0PRG(5) 


(continu ed) 
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PROGRAM 11 (continued) 

70 FORMA. T(1X,*R0. OP STURMTS IN PHD=*',I5) 

PRINT 75,X'rPIN(1) 

75 FORMAT (1X,*N0 OP STURMTS GETTING INST .SOHO LARSHIP=*, 14) 
PRINT S0,NPIN(2) 

80 P0RI4AT(1X,*N0. OP STUREiXTS GETTING UGO S0H0LARSHIP=*,I4) 

PRINT 85,NPni(3) 

85 FORMAT (1X,*N0. OP STURENTS HAYING OTHER PIN,SIIPP0RT=*,I4) 
GO TO 10 
90 DO 95 1=1,5 

95 ISUM=ISTJM+NORPT(l) 

PRINT 100 

100 FORMAT (1X,*T0TAL NO. OP STIIDENTS=* , 17) 

STOP 

END 
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PROGRAM 12 

G 12 S=43 0=05 I>=05 E=1 2 REP YR 169 

G QUESTimiUIRE MALY SIS 

INTEGER SEX(2) ,DIGRI(5),SI]MAGE,AYAGE,SERL'IO,SGOLE,AGE,OODIR, 

1 LEGGOI>,EALvGAR 

LIMElISIOi^ IRST(7) ,MRT(5) ,IJ-^SIRT(5,7) 

LATA IL S T , liM'TRT , IRSM'T , BALOAR , DIGRI , SLMAGE/ 5 4*0/ 

SEX(1)=0 

SEX(2)=0 

il0QS=0 

1 0 READ 1 5 , SERNO , SCORE , AGE, IRSOOL, OODI14 , DEGGOL 
15 P0RMAT(I3, 11 ,12,311 ) 

IP(SERi'fO.EQ.O) GO TO 30 
IP(S00DE.EQ.0.0R.S00LE.GT.2) GO TO 20 
IP(IRSCOL,EQ,O.OR.IRSGOL.GT.7) GO TO 20 
IP(GOLIi4.EQ.O,OR.GODlR.GT.5) GO TO 20 
IP(LEGOOL.EQ.O.OR.LEGOOL.GT.5) GO TO 20 
SEX ( S CODE ) =SSX ( S COLE ) +1 
IN S T ( IRSG OE) =Ii'f ST ( IMS GOD ) +1 
LI GRI ( DEGGOL )=DIGRI (DEGGOL ) +1 
IM S IM T ( GODIM , IM S GOD ) =LI S IMT ( 0 ODIM , IMSGOD ) +1 
MOQSMOQS+1 
S JMAGE=SUMAGE+AGE 
GO TO 10 

20 PRIMT 25,SERM0 

25 P0RMAT(1X,*ERR0R IM GARB- N0,*,2X,I4) 

BADGAR;=BADGAR+1 
GO TO 10 

50 PRIMT 35 ,MOQS ,BADGAR 


(continued) 
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PROGRAM 12 (continued) 

35 P0RMAT(1X,10X,*1T0.0P GOOD OARDS=*,I3/1X,*RO.OP BAD 0ARDS=*,I3) 

PRINT 40,SEX(1 ),SEX(2) 

40 P0EMAT(1X,*N0.0P ]y[ALE=*,I3/1X,*N0.0F PEMALE=*,I3) 

PRINT 45, (B'fST(l),I=1 ,7) 

45 P0EMAT(1X,*1*,3X,I3/1X,*2*,3X,I3/1X,*3*,3X,I3/1X,*4*, 

13X,I3/1X,*5*,3X,I3/1X,*6*,5X,I3/1X,*7*,3X,I3/) 

PRINT 50, (INTRT(I),I=1 ,5) 

50 PORMAT(lX,*SGI*,3X,I3/1X,*Ex1G*,3X,I3/1X,*MATH*,5X,I3/ 

11X,*S0G*,3X,I3/1X,*BUS*,3X,I3) 

PRINT 55,(DIGRI(I) ,1=1,5) 

55 PORMAT(lX,*]yiATR*,3X,I3/1X,*ISC*,4X,I3/1X,*BSO*,4X,I5/ 

11X,*MSC*,4X,I3/1X,*PHD*,4X,I3) 

AVAGB=SIJMAGE/N OQS 
PRINT 60,AVAGE 

60 POmAT(lX,*AVERAGE AGE=*,P8.2) 

STOP 

EN'D 
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PROGBAM 13 


0 15 S=38 0=08 D=05 E=07 REP 213 

0 PROGRAM 70 OHEOK BAilR ACOOUl^fT BALMOE 

DIMMSIOR BMO(10) ,I0HARG(10) 
lii TEGER AGRO (10), ARAME( 10,5) , AGRT 
DO 5 1=1,10 

READ 10,AGR0(I) ,(At'iAME(l,J),J=1 ,5 ) ,8^0(1) 

10 PORiyiA7(l2,5A4,P10.2) 

5 CORTMUE 

PRIl'JT 15 ■ ' 

15 PORMATdHI ,20X,*TRAL1SG. RECORD BEGIR*) 

DO 20 1=1 ,10 
20 ICHARG(I)=0 
25 READ 30,A0R'T,TAMT,IG0D 
30 P0RMAT(I2,P8.2,I1) 

IP(AGi'lT.EQ.O) GO TO 75 
DO 65 1=1 ,10 

IP(AGNT.NE.AGilO(l)) GO TO 65 
PRIx’IT 35,ACN0(I) ,BMG(I) 

35 P0RMAT(1X,^3,2X,P10.2) 

IP(IOOD.EQ.O) GO TO 50 
IP(BLRG(I) .LT.TAM1.4T) GO TO 40 
BLR G ( I ) =BLi'l 0 ( I ) -TAM T 

GO TO 55 

40 BLi40(l)=BLi'JG(l)-3.0 

PRIRT 45,Aai40(l) 

45 P0RI/[AT(1 X,I5 , 1 0X,*GHEQUE ROT HOR CURED BAD 'GHEQUE CHARGE DED.*) 
GO TO 70 

50 BLRa(l)=BLRC(l)+TAMRT 
55 igharg(i)=lgharg(i)+i 


(coAtiaued) 
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PROGRAM 13 (oontinued) 


IP(IGHARG(I).GT.10)I0HAR&(I)=10 
POTT60 ,A0Ero(l ) , Bn-TOd ) , TAMT 
60 PORjy[AT(1X,I3,5X,R10.2,5X,P10,2) 

65 CORTIilUE 
70 GO TO 25 
75 PQ SO 1=1,10 

SO BM C ( I ) =BLl 1 0 ( I ) -0 . 5 *PLOA T ( lOHAR G ( I ) ) 

DO 100 1=1,10 

PRIi^T 85,ACilO(l) , (Ai'lAME(I,J),J=1 ,5) ,BMG(l) 
85 P0HyLAT(l2,5A4,R10..2) 

100 COiiTIAtUE 
PRIMP 90 

90 PO miiP (IX,* TRAM SAOTIOM OLOSED* ) 

STOP 

EilD 



PROGRAM 14 


0 14 S=41 0=10 D=02 E=12 REP PM 379 

0 PROGRAM PRIRTS THE INTEGER RUI4BER PROM A CARD 

PIMEAS lOR lEIG ( 1 0 ) , lA ( SO ) 

DATA IBIi'JK,IPIUS,IMIi'lUS,IDIG/1H ,1H+,1H-,1H0,1H1 , 
11H2,1H3,1H4,1H5,1H6,1H7,1H8,1H9/ 

RE/iD 10, lA 
10 POEMAT(SOAI) 

PRIRT 15,IA 
15 P0RMT(1X,80A1 ) 

IDG=0 

RUM=0 

J=0 

20 J=J+1 

IP(J.GT.80) GO TO 50 
IP(IA(J) .EQ.IBLRK) GO TO 20 
ISIR=+1 

IP(IA(J) .RE.IMIiWS) GO TO 25 

ISIR=-1 

J=J+1 

GO TO 30 

25 IP(IA(J).EQ.IPLUS)J=J+1 
30 IP(J.GT.SO) GO TO 60 
DO 35 1=1 ,10 

IP(IA(J).EQ.IDIG(I)) GO TO 75 
35 COR T TRUE 

IP(lA(J).i1E.I3IRK) GO TO 65 
IE(IDG.EQ.O) go to 65 
40 i^IM=RUM*ISIR 
PRIRT 45, RDM 


( Goatinued) 
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PROG-RAM 14 (contirrued ) 

45 rORMAT(1X,*VALUE=*,2X,I10) 

GO TO 100 
50 PRm 55 

55 FORMAT (1X,*BLARK CARP*) 

GO TO 100 

60 IF (IPG .RE. 0) GO TO 40 
65 PRIRT 70,IA,J 

70 FORMAT (6X,30A1/6X,*IRYALIP CHARACTER imR COL.*, 15) 
GO TO 100 
75 i’IUM=R'ra*1 0+1-1 
IPG=1 
J=J+1 
GO TO 30 
STOP 
Ei’JD 


100 
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PROGRA.M 15 

0 15 3=45 G=10 11=07 E=04 HEP xlOi'JE 

C PRIxMTIi'I'G LIST OE STUlEi.'iTS OE 1EPARE4MT 

IIMUmSIOR 1EPT(5) ,ALAME(10, 5) ,11001(10) ,JR0L(10) , 
1IEC0l(10),Si^lME(5) 

IiMTEGER ROI, PROG, ERG 
10 130 1=1 ,5 

GO TO (10,20,30,40,50),! 

10 PRIRT 15 

1 5 EORMAT ( 1H 1 , 30X , *1EP T . OE AEROR AUT I GAL M GG . * ) 

GO TO 60 
20 PRIRT 25 

25 FORMAT ( 1 HI , 30X , *DSPT . OE GHEMI GAL Eli GG . * ) 

GO TO 60 
30 PRIRT 35 

35 FORMAT ( 1 H1 ,30X,*DEPT. OP GIYIL ERGG.*) 

GO TO 60 
40 PRIRT 45 

45 FORMAT ( 1H1 ,30X,*1EPT.0E ELEGTRIGAL ERGG.*) 

GO TO 60 
50 PRIRT 55 

55 FORMAT (1 HI ,30X,*1EPT.0E MEOHAiaGAL ErGG.*) 

60 JK=0 

10 75 J=1 ,10 

REAL 65 ,R0L,(SRA1LE(I),I=1 ,5) , PROG, ERG 
65 E0RMAT(I6,1X,5A4,2I1 ) 

lE(ROL.EQ.O) GO TO SO 
PK=JK+1 
DO 70 K=1,5 


(coRtinued) 
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PROGRAM 15 (continued) 

70 A1TAME( J , K ) =SRAME ( K ) 

JR0L(J)=R0L 
IPGOI>(J)=PROG 
75 IPaOD(J)=Pi7C 
80 po 90 M^1 ,jk: 

IP(lP00D(M).i'^E.1) GO TO 90 

PRli^T 85 , JROL(M) , (Ail AME (M ,K ) ,K=1 ,5 ) , IPOOP(M) , lEaOD(M) 
35 P0RMAT(1X,I7,2X,5A4,3X,I2,3X,I2) 

90 COi'iTiUUE 

PO 100 R=1 ,JK 

IP(lPG0P(i'l).-L^E.2) GO TO 100 

PRIiST 95 , JROL(R ) , (Ai'lAME(i'l ,K) ,K=1 ,5 ) ,IPOOP(l'f ) ,IPG0P(R ) 
95 PORl'iAT(lX,I7,2X,5A4,3X,I2,3X,l2) 

100 COx'^TIRUE 

PO 120 1=1 ,JK 

Ip(lP00P(L).iNE.3) GO TO 120 

PRIi^T 1 10, JROL(L) , (Ai')AME(L,K) ,K=1 ,5 ) ,IPGOP(I),IEGOP(P) 
110 E0BMAT(1X,I7,2X,5A4,3X,I2,3X,I2) 

120 COi'tTIi'lUE 
130 GOi^TIilUE 
STOP 
Ei'fP 
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PROGRAM 16 


G 16 3=52 0=13 P=10 E=14 REP VR 163 

0 STUESl'ITS grade AiULYSIS 

DIMSi'iSIOi.M AilAME(7 ) ,MARKS(6 ) , ISUB (6 ) , ISUAV(6 ) , IG07 (6 , 6 ) , 
1C07(6,6),I3R0P(6,6),00RR(6,6) ,TDE7(6) ,SDEV(6) 

DATA i>lSTUD , ISUB , lOROP ,ATLAST/43*0 , 1 H*/ 

10 READ 15,(Ai^AME(l),I=1,7),(mRKS(l),I=1 ,6) 

15 FORMAT (7A4, 61 3) 

IF (Ai'iAME(l).EQ. ATLAS T) GO TO 60 

RSTUD=iMSTUD+1 

MARTI =0 

DO 20 1=1 ,6 

MARTI=MARTL+MARKS ( I ) 

20 ISUB(I)=ISUB(I)+MARKS(I) 

AVG=FLOAT (MARTI )/e. 0+0 . 5 
DO 25 1=1 ,6 
DO 25 J=1 ,6 

25 IGRCPd, J)=IGR0P(I,J)+MARKS(I)*MARKS(J) 

IF(AYG.LT.60.0) GO TO 30 

k:div=i 

GO TO 50 

30 IP(AVG.IT.50;i GO TO 35 
KDIV=2 
GO TO 50 

35 IF(aVG.LT.40.0) go TO 40 

KDIV=3 
GO TO 50 

40 PRIilT 45 , (Aj-'*AI’ 1E(I) ,I=d7) ,A^<^ 

45 FORMAT (1X,7A4,2X.5’5* 2, 2X,6HFAILED) 

GO TO 10 

50 puli'! T 55 , (•^^AME(I) ,I=d7) ,A'7G,OI7 
55 format ( 1X,7A4,2X,R5. 2, 2X, *PAS SED li'i *,I1) 


(coiiti nu ed) 
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PROGRAjyi 16 (continued) 

GO TO 10 
60 DO 65 1=1 ,6 
65 ISUA7(I)=ISUB(I)/ITSTUD 
PRINT 70, (ISUAY(I) ,1=1 ,6) 

70 I'OBMAT(lX,*PHYS*,3X,*CHiM*,3X,*IviATH*,3X,*T.A.*,5X,*E.SC*, 
1*HIMSV1X,6(I4,3X)) 

DO 80 1=1 ,6 
DO 75 J=1,6 

ICOV ( r , J ) =IOROP ( I , J )/NSTUD-ISUAV( I ) *ISUAV( J) 

75 OOV(I,J)=ICOV(I,J) 

SDE7(I)=SQRT(00V(I,I)) 

SO IDEV(I)=SDEV(I) 

PRINT 85,(IDEV(I),I=1,6) 

35 PORMA T(1X,*STD. DEVIATION*, 1X, 6 (13, 3X)) 

DO 90 1=1,6 
10 90 J=1 ,6 

90 OORR(I,J)=OOV(I,J)/SQRT(OOV(I,I)*OOV(J,J)) 

PRINT 95,((00RR(I,J),J=1,6),I=1 ,6) 

95 EOMAT ( 1 X, *CORD . OOEPE .MATRIX*/ 1 X , *PHYS* , 6E1 0 . 2/1 X ,*OHEM* , 
15E10.2/1X,*MATH*,4P'10.2/1X,*T.A.*,5P10.2/1X,*E.S0*,2E10.2/ 
11X,*HUMS*,F10.2) 

STOP 

END 



APPENDIX- I 
OOMPLEXITY RATIN& 

IMSTRU:;II0N 

¥e are engaged in developing a measure for the complexily 
of programs. This complexiiy measure -will reflect the diffi- 
culty in comprehension of the program. In this experiment 
you have to give your judgement about the program complexity 
from the point of vievj of understanding it. This program 
complexity may he due to size of the program, or its complex 
control structure or its complex data structure or executional 
behaviour or all of them. The experiment will be conducted 
in four sessioti-s each of one and half hour duration* In each 
quarter of the session you will be given different program. 
Please read the program carefully and try to understand. 

After 15 minutes -we shall take back the program from you. In 
next 5 minutes you prepare a brief sutnmaiy of the program. The 
next task in the experiment Is to give your judgement about the 
complexi-ly of the program you read. Tou have to give your 
complexity rating on a scale of 10 points. If you judge the 
program extremly simple you should give 0 rating whereas 
you think program extremly complex you should give a rating of 
10. Please also give the percentage of contributing factors 
to the program complexity. So in total of 25 minutes, 15 minutes 
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you spend in reading and understanding the program, 5 minutes 
in preparing summary and 5 minutes in giving your judgement. 
Thus process -will be repeated for other three quarters of the 
session with different programs. 

If you find any error in the program please do point 
it out in the summary. 


RTMIMBER YOU ARE JUDGING PROGRAM OOMPLEXITY. 
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